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THERAPEUTIC AND DIAGNOSTIC AGENTS - H 

The present invention relates generally to therapeutic and diagnostic agents. More 
particularly, the present invention provides therapeutic molecules capable of modulating 
5 signal transduction such as but not limited to cytokine-mediated signal transduction. The 
molecules of the present invention are useful, therefore, in modulating cellular responsiveness 
to cytokines as well as cytokine-mediated immune response mechanisms. 

Bibliographic details of the pubUcations referred to in this specification by author are collected 
10 at the end of the description. Sequence Identity Numbers (SEQ ID NOs.) for the nucleotide and 
amino acid sequences referred to in the specification are defined following the bibliography. 

Throughout this specification and the claims which follow, unless the context requires 
otherwise, the word "comprise", or variations such as "comprises" or "comprising", will be 
15 understood to imply the inclusion of a stated integer or group of integers but not the exclusion 
of any other integer or group of integers. 

Cytokines are secreted proteins that regulate the survival, proliferation, differentiation and 
function of a variety of 'cells within the body [Nicola, 1994]. The haemopoietic cytokines 

20 have in common a four-alpha helical bundle structure and the vast majority interact with a 
structurally related family of cell surface receptors, the type I and type II cytokine receptors 
[Bazan, 1990; Sprang, 1993], In all cases ligand-induced receptor aggregation appears to be 
a critical event in initiating intracellular signal transduction cascades. Some cytokines, for 
example growth hormone, erythropoietin (Epo) and granulocyte colony stimulating factor, 

25 trigger receptor homodimerisation, while for other cytokines, receptor heterodimerisation or 
heterotrimerisation is crucial. In the latter cases, several cytokines share common receptor 
subunits and on this basis can be grouped into three subfamilies with similar patterns of 
intracellular activation and similar biological effects [Hilton, 1994]. Interleuldn-3 (IL-3), IL- 
5 and granulocyte-macrophage colony-stimulatmg factor (GM-CSF) use the common p- 

30 receptor siibimit (pc), and eacli cytokine stimulates the production and functional activity of 
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granulocytes and macrophages. IL-2, IL-4, IL-7, IL-9, and IL-15 each use the commonY- 
chain (yc), while IL-4 and IL^13 share an alternative y-chain (y^c or IL-13 receptor a-chain). 
Each of these cytokines plays an important role in regulating acquired imm unity in the 
lymphoid system. Finally, IL-6, IL-11, leukaemia inhibitory factor (LIF), oncostatin-M 
5 (OSM), ciliary neurotrophic factor (CNTF) and cardiotrophin (CT) share the receptor subunit 
gpl30. Each 6f these cytokines appears to be highly pleiotropic, having effects both within 
and outside the haemopoietic system [Nicola, 1994]. 

In all of the above cases at least one subunit of each receptor complex contains the conserved 

10 sequence elements, termed boxl and box2, in their cytoplasmic tails [Murakami, 1991]. 
Boxl is a proline-rich motif which is located more proximal to the transmembrane domain 
than the acidic box 2 element. The box-1 region serves as the binding site for a class of 
cytoplasmic tyrosine kinases termed JAKs (Janus kinases). Ligand- induced receptor 
dimerisation serves to increase the catalytic activity of the associated JAKs through cross- 

15 phosphorylation. Activated JAKs then tyrosine phosphorylate several substrates, including 
the receptors themselves. Specific phospho tyrosine residues on the receptor then serve as 
docking sites for SH2-containing proteins, the best characterised of which are the signal 
transducers and activators of transcription (STATs) and the adaptor protein, she. The STATs 
are then phosphorylated on tyrosmes, probably by JAKs, dissociate from the receptor and 

20 form either homodimers or heterodimers through the interaction of the SH2 domain of one 
STAT with the phosphotyrosine residue of the other. STAT dimers then translocate to the 
nucleus where they bind to specific cytokine-responsive promoters and activate transcription 
[Darnell, 1994; Ihle, 1995; Ihle, 1995]. In a separate pathway, tyrosine phosphorylated she 
interacts with another SH2 domain-containing protein, Grb-2, leading ultimately to activation 

25 of members of the MAP kinase family and in turn transcription factors such as fos and jun 
[Sato, 1993; Cutler, 1993]. These pathways are not unique to members of the cytokine 
receptor family, with c)^okines that bind receptor tyrosine kinases also being able to activate 
STATs and members of the MAP kmase family [David, 1996; Leaman, 1996; Shual, 1993; 
Sato, 1993; Cutler, 1993]. 
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Four members of the JAK family of cytoplasmic tyrosine kinases have been described, JAKl, 
JAK2, JAK3 and TYK2, each of which binds to a specific subset of cytokine receptor 
subunits. Six STATs have been described (STATl through STAT6), and these too are 
activated by distinct cytokine/receptor complexes. For example, STATl appears to be 
5 functionally specific to the interferon system, STAT4 appears to be specific to IL-12, while 
STAT6 appears to be specific for IL-4 and IL-13. Thus, despite common activation 
mechanisms some degree of cytokine specificity may be achieved through the use of specific 
JAKs and STATs [Thierfelder, 1996; Kaplan, 1996; Takeda, 1996; Shunoda, 1996; Meraz, 
1996; Durbin, 1996]. 

10 

In addition to those described above, there are clearly other mechanisms of activation of these 
pathways. For example, the JAK/STAT pathway appears to be able to activate MAP kinases 
mdependent of the she-induced pathway [David, 1995] and the STATs themselves can be 
activated without bmding to the receptor, possibly by direct interaction with JAKs [Gupta, 
15 1996]. Conversely, full activation of STATS may require the action of MAP kinase in 
addition to that of JAKs [David, 1995; Wen, 1995]. 

While the activation of these signalling pathways is becoming better understood, little is 
known of the negative feedback loops. 

20 

In work leading up to the present invention, the inventors sought to isolate negative regulators 
of signal transduction. The inventors have now identified a new family of SH2 domain- 
containing proteins which are capable of acting as regulators of cytokine signalling. The 
identification of this new family of cytokine regulatory molecules permits the generation of 
25 a range of molecules capable of modulating cytokine signal transduction. The present 
invention provides, therefore, a new family of negative regulators of signal transduction. 

The regulatory molecules of the present invention are referred to as suppressors of cytokine 
signalling or "SOCS". Specific SOCS are defined numerically, for example, SOCSl, S0CS2 
30 an^SOCS'3.^^ 
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Accordingly, one aspect of the present invention provides a nucleic acid molecule comprising 
a sequence of nucleotides encoding or complementary to a sequence encoding an SH2 
domain-containing protein or a derivative thereof which protein suppresses cytokine signal 
5 transduction. 

More particularly, the present invention provides a nucleic acid molecule comprising a 
sequence of nucleotides encoding or complementary to a sequence encoding an SH2 domain- 
containing protein or a derivative thereof which protein suppresses cytokine signal 
10 transduction mediated by at least one of IL-6, LIF, OSM, IFN-y or thrombopoietm. 

Even more particularly, the present invention contemplates a nucleic acid molecule 
comprising a sequence of nucleotides encoding or complementary to a sequence encoding an 
SH2 domain-containing protein or a derivative thereof which protein suppresses cytokine 
15 signal transduction mediated by IL-6 in Ml cells expressing said nucleic acid molecule. 

Another aspect of the present invention provides a nucleic acid molecule comprising a 
sequence of nucleotides encoding or complementary to a sequence encoding an SH2 domain- 
containing protein or a derivative thereof which protein modulates cellular responsiveness to 
20 cytokines. 

More particularly, the present invention is directed to a nucleic acid molecule comprising a 
sequence of nucleotides encoding or complementary to a sequence encoding an SH2 domain- 
containing protein or a derivative thereof which protein modulates cellular responsiveness to 
25 one or more of IL-6, LIF, OSM, IFN-y or thrombopoietin. 

Even more particularly, the present invention relates to a nucleic acid molecule comprising 
a sequence of nucleotides encoding or complementary to a sequence encoding an SH2 
domain-containing protein or a derivative thereof which protein modulates cellular 
30 responsiveness to IL-6 when tested in Ml cells expressing said nucleic acid molecule. 



P:\OPER\EJH\SOCS-n.PRV - 14/2/97 



-5- 

The present invention further contemplates an isolated protein or a derivative thereof 
comprising a sequence of amino acids having an SH2 domain and which is capable of 
regulating cytokine signal transduction, 

5 Another aspect provides an isolated protein or a derivative thereof comprising a sequence of 
amino acids "having an SH2 domain and which is capable of regulating cytokine signal 
transduction for one or more of ILr6, LIF, OSM, IFN-y or thrombopoietin. 

Still another aspect of the present invention is directed to an isolated protein or a derivative 
10 thereof comprising a sequence of amino acids having an SH2 domain and which is capable 
of regulating cytokine signal transduction mediated by IL-6 in Ml cells, expressing said 
protein. 

Yet still another aspect of the present invention provides an isolated protein or a derivative 
15 thereof comprising a sequence of amino acids having an SH2 domain and which is capable 
of modulating cellular responsiveness to cytokines. 

In a related embodiment, the present invention is directed to an isolated protein or a derivative 
thereof comprising a sequence of amino acids having an SH2 domain and which is capable 
20 of modulating cellular responsiveness to one or more of IL-6, LIF, OSM, IFN-y or 
thrombopoietin. 

A further embodiment provides an isolated protein or a derivative thereof comprising a 
sequence of amino acids having an SH2 domain and which is capable of modulating cellular 
25 responsiveness to IL-6 when tested in Ml cells expressing said protein or a mimetic of said 
protein. 

In accordance with these aspects of the present invention, the nucleic acid molecules encode 
a SOCS or a derivative thereof. Reference to a "SOCS" includes reference to family 
30 members ofSbCS such as SOCS 1, S0'CS2 and SOC^^ SOCS have an SH2 domain. They 
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also cx)mprise a C-terminal domain referred to as a C-terminal SOCS domain. Preferably, 
the SOCS is in recombinant form although the present invention extends to a naturally 
occurring SOCS in isolated or purified form. The terms "isolated" and "purified" means that 
a molecule has undergone at least one purification step away from other material. 

5 

Preferably, the nucleic acid molecule is m isolated form and is DNA such as cDNA or 
genomic DNA. The DNA may encode the same amino acid sequence as in the naturally 
occurring SOCS or the SOCS may contain one or more amino acid substitutions, deletions 
and/or additions. The nucleotide sequence may correspond to the genomic coding sequence 
10 or to the nucleotide sequence in cDNA of the naturally occurring mRNA or may carry one 
or more nucleotide substitutions, deletions and/or additions thereto. 

In one embodiment, the nucleotide sequence of the SOCS, regardless of from which sub- 
family, comprises the sequence set forth in SEQ ID NO:3 or SEQ ID NO:9 or SEQ ID 
15 NO: 11 or has at least 15% homology thereto and/or is capable of hybridising the nucleotide 
sequence set forth in SEQ ID NO: 3 under low stringency conditions. The nucleotide 
sequence set forth in SEQ ID NO: 3 encodes mouse SOCSl. SEQ ID NO: 9 encodes human 
SOCS 1 and SEQ ID NO: 1 1 encodes rat SOCS 1 . 

20 In another embodiment, the nucleotide sequence of the SOCS, regardless of from which sub- 
family, comprises the sequence set forth m SEQ ID NO: 5 or has at least 15% homology 
thereto and/or is capable of hybridising the nucleotide sequence set forth in SEQ ID NO:5 
xmder low strmgency conditions. The nucleotide sequence set forth m SEQ ID NO:5 encodes 
mouse S0CS2, 

25 

In a fiirther embodiment, the nucleotide sequence of the SOCS, regardless of from which sub- 
family, comprises the sequence set forth in SEQ ID N0:7 or has at least 15% homology 
thereto and/or is capable of hybridising the nucleotide sequence set forth in SEQ ID NO:7 
under low stringency conditions. The nucleotide sequence set forth in SEQ ID NO:7 encodes 
30 mouse SOCSS. 
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Within a sub-family, i.e. intra-sub-family, the level of homology is higher, such as at least 
about 35%. 

The present invention further contemplates a nucleotide sequence encodmg SOCSl 
5 comprising a sequence set forth in SEQ ID NO:3 or SEQ ID NO:9 or SEQ ID NO: 11 or 
having at leaSt 35 % homology thereto and/or is capable of hybridising to the nucleotide 
sequence of SEQ ID N0:3 or SEQ ID N0:9 and SEQ ID NO: 11 under low strmgency 
conditions. 

10 The present invention further contemplates a nucleotide sequence encoding mouse SOCS2 
comprising a sequence set forth in SEQ ID NO:5 or havmg at least 35% homology thereto 
and/or is capable of hybridising to the nucleotide sequence of SEQ ID NO:5 under low 
stringency conditions. 

15 The present invention further contemplates a nucleotide sequence encoding mouse SOCS3 
comprising a sequence set forth in SEQ ID N0:7 or having at least 35% homology thereto 
and/or is capable of hybridising to the nucleotide sequence of SEQ ID NO:7 under low 
stringency conditions. 

20 Reference herein to a low stringency at 42 "^C includes and encompasses from at least about 1% 
v/v to at least about 15% v/v formamide and from at least about IM to at least about 2M salt 
for hybridisation, and at least about IM to at least about 2M salt for washing conditions. 
Altemative stringency conditions may be appUed where necessary, such as medium stringency, 
which includes and encompasses from at least about 16% v/v to at least about 30% v/v 

25 formamide and from at least about 0.5M to at least about 0.9M salt for hybridisation, and at 
least about 0.5M to at least about 0.9M salt for washing conditions, or high stringency, which 
includes and encompasses from at least about 3 1% v/v to at least about 50% v/v formamide and 
from at least about 0.0 IM to at least about 0.1 5M salt for hybridisation, and at least about 
O.OIM to at least about 0.1 5M salt for washing conditions. 
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In a related embodiment, the nucleotide sequence encodes an amino acid sequence as set forth 
in SEQ ID NO:4 (referred to herein as SOCSl) or has at least about 15% similarity thereto. 

In a further related embodiment, the nucleotide sequence encodes an amino acid sequence as 
5 set forth in SEQ ID NO: 6 (referred to herein as mouse SOCS2) or has at least about 15% 
similarity thefeto. 

In another related embodiment, the nucleotide sequence encodes an amino acid sequence as 
set forth in SEQ ID NO: 8 (referred to herein as mouse SOCS3) or has at least about 15% 
10 similarity thereto. 

In yet another related embodiment, the nucleotide sequence encodes an amino acid sequence 
as set forth in SEQ ID NO: 10 (referred to herein as human SOCSl) or has at least about 15 % 
similarity thereto. 

15 

In yet a further related embodiment, the nucleotide sequence encodes an amino acid sequence 
as set forth in SEQ ID NO: 12 (referred to herem as rat SOCSl) or has at least about 15% 
similarity thereto. 

20 The above sequence comparisons are preferably to the whole molecule but may also be to part 
thereof. Preferably, the comparisons are made to a contiguous series of at least about 21 
nucleotides or at least about 7 amino acids. 

Still another embodiment of the present invention contemplates an isolated polypeptide 
25 comprising a sequence of amino acids substantially as set forth in SEQ ID NO:4 or having 
at least about 15 % similarity thereto. 

Still another embodiment of the present invention contemplates an isolated polypeptide 
comprising a sequence of amino acids substantially as set forth in SEQ ID NO: 6 or having 
30 at feast about"! 5 % similarity "^thereto. ' 
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Still another embodiment of the present invention contemplates an isolated polypeptide 
comprising a sequence of amino acids substantially as set forth in SEQ ID NO: 8 or having 
at least about 15% similarity thereto. 

5 Still another embodiment of the present invention contemplates an isolated polypeptide 
comprising a gbquence of amino acids substantially as set forth in SEQ ID NO: 10 or having 
at least about 15% similarity thereto. 

Still another embodiment of the present invention contemplates an isolated polypeptide 
10 comprising a sequence of amino acids substantially as set forth in SEQ ID NO: 12 or having 
at least about 15 % similarity thereto. 

Preferred nucleotide percentage similarities include at least about 20%, at least about 40%, 
at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 
15 90% or above such as 93%, 95%, 98% or 99%. 

Preferred amino acid similarities include at least about 20%, at least about 30%, at least about 
40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least 
about 90%, at least about 95%, at least about 97% or 98% or above. 

20 

The term "similarity" includes exact identity between sequences or, where the sequence 
differs, different amino acids are related to each other at the structural, functional, 
biochemical and/or conformational levels. 

25 The nucleic acid molecule may be isolated from any animal such as humans, livestock animals 
(e.g. horses, cows, sheep, donkeys, pigs), laboratory test animals (e.g. mice, rats, rabbits, 
hamsters, guinea pigs or primates), companion animals (e.g. dogs, cats) or captive wild 
animals (e.g. deer, foxes, kangaroos, monkeys and other primates). 

30 The terms '"derivatives" or its* singular form "derivative" whether in relation to a nucleic acid 
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molecule or a protein includes parts, mutants, fragments and zmalogues as well as hybrid or 
fusion molecules and glycosylation variants. Preferably, the derivatives have functional 
activity or alternatively act as antagonists or agonists, 

5 One example of an antagonist is an antisense oligonucleotide sequence. Useful 
oligonucleotides are those which have a nucleotide sequence complementary to at least a 
portion of the protein-coding or "sense" sequence of the nucleotide sequence. These anti- 
sense nucleotides can be used to effect the specific inhibition of gene expression. The 
antisense approach can cause inhibition of gene expression apparently by forming an anti- 
10 parallel duplex by complementary base pairing between the antisense construct and the 
targeted mRNA, presumably resulting in hybridisation arrest of translation.. Ribozymes and 
co-suppression molecules may also be used. Antibodies may also act as either antagonists or 
agonists. 

1 5 The present invention extends to analogues of the SOCS proteins of the present invention and 
their use in the treatment or prophylaxis of cytokine mediated dysfunction such as 
autoimmunity, immune suppression or hyperactive immunity or other condition including but 
not limited to dysfunctions in the haemopoietic, endocrine, hepatic and neural systems. 
Analogues of the proteins contemplated herein include, but are not limited to, modification to 

20 side chains, incorporating of unnatural amino acids and/or their derivatives during peptide, 
polypeptide or protein synthesis and the use of crosslinkers and other methods which impose 
conformational constraints on the proteinaceous molecule or their analogues. 

Examples of side chain modifications contemplated by the present invention include 
25 modifications of amino groups such as by reductive alkylation by reaction with an aldehyde 
followed by reduction with NaBH4; amidination with methylacetimidate; acylation with acetic 
anhydride; carbamoylation of amino groups with cyanate; trinitrobenzylation of amino groups 
with 2, 4, 6-trinitrobenzene sulphonic acid (TNBS); acylation of amino groups with succinic 
anhydride and tetrahydrophthalic anhydride; and pyridoxylation of lysine with pyridoxal-5- 
30 phosphate followed by reduction \Vith'NaBH4. 
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The guanidine group of arginine residues may be modified by the formation of heterocyclic 
condensation products with reagents such as 2,3-butanedione, phenylglyoxal and glyoxal. 

The carboxyl group may be modified by carbodiimide activation via O-acylisourea formation 
5 followed by subsequent derivitisation, for example, to a corresponding amide. 

Sulphydryl groups may be modified by methods such as carboxymethylation with iodoacetic 
acid or iodoacetamide; performic acid oxidation to cysteic acid; formation of a mixed 
disulphides with other thiol compounds; reaction with maleimide, maleic anhydride or other 
10 substituted maleimide; formation of mercurial derivatives using 4-cliloromercuribenzoate, 4- 
chloromercuriphenylsulphonic acid, phenylmercury chloride, 2-chloromercuri-4-nitrophenol 
and other mercurials; carbamoylation with cyanate at alkaline pH. 

Tryptophan residues may be modified by, for example, oxidation with N-bromosuccinimide or 
15 alkylation of the indole ring with 2-hydroxy-5-nitrobenzyl bromide or sulphenyl halides. 
Tyrosine residues on the other hand, may be altered by nitration with tetranitromethane to form 
a S-nitrotyrosine derivative. 

Modification of the imidazole ring of a histidine residue may be accomplished by alkylation 
20 with iodoacetic acid derivatives or N-carbethoxylation with diethylpyrocarbonate. 

Examples of incorporating unnatural amino acids and derivatives during peptide synthesis 
include, but are not Umited to, use of norleucine, 4-amino butyric acid, 4-amino-3 -hydroxy- 5- 
phenylpentanoic acid, 6-aminohexanoic acid, t-butylglycine, norvaline, phenylglycine, 
25 ornithine, sarcosine, 4-amino-3-hydroxy-6-methylheptanoic acid, 2-thienyl alanine and/or D- 
isomers of amino acids. A list of unnatural amino acid, contemplated herein is shown in Table 
L 

Crosslinkers can be used, for example, to stabilise 3D conformations, using homo-bifunctional 
30 crosslinkers such as the bifunctional imido esters having (CH2)n spacer groups with n=l to 
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n=6, glutaraldehyde, N-hydroxysuccinimide esters and hetero-bifunctional reagents which 
usually contain an amino-reactive moiety such as N-hydroxysuccinimide and another group 
specific-reactive moiety such as maleimido or dithio moiety (SH) or carbodiimide (COOH). 
In addition, peptides can be confonnationally constrained by, for example, incorporation of C„ 
5 and N^-methylamino acids, introduction of double bonds between and atoms of amino 
acids and the formation of cyclic peptides or analogues by introducing covalent bonds such as 
forming an amide bond between the N and C termini, between two side chains or between a 
side chain and the N or C terminus. 

10 These types of modifications may be important to stabilise the cytokines if administered to an 
individual or for use as a diagnostic reagent. 

TABLE 1 



15 Non-conventional Code Non-conventional Code 

amino acid amino acid 



a-aminobutyric acid 


Abu 


L-N-methylalanine 


Nmala 


a-amino-a-methylbutyrate 


Mgabu 


L-N-methylarginine 


Nmarg 


aminocyclopropane- 


Cpro 


L-N-methylasparagine 


Nmasn 


carboxylate 




L-N-methylaspartic acid 


Nmasp 


aminoisobutyric acid 


Aib 


L-N-methylcysteine 


Nmcys 


aminonorbomyl- 


Norb 


L-N-methylglutamine 


Nmgln 


carboxylate 




L-N-methylglutamic acid 


Nmglu 


cyclohexylalanine 




Chexa L-N-methylhistidine 


Nmhis 


cyclopentylalanine 


Cpen 


L-N-methylisoUeucine 


Nmile 


D-alanine 


Dal 


L-N-methylleucine 


Nmleu 


D-arginine 


Darg 


L-N-methyllysine 


Nmlys 


D-aspartic acfd 


t)asp " 


L-N-methylmethionine 


Nmmet 
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D-cvsteine 




I-^ l>(--I-LlCUJ.y iiUJl iCUL/iliC 


"Mm nl ^ 
INiiiillC 


D-fflutamine 




J— ' 1^ lXiCUJ.yiJJ.iJl VcLilllC 


^Ttn TW7Q 

IN Hill V a. 




Dalu 


T — ^J^-m p1"Vi\/lomifliin#a 
i>( iiidJi y iVJi III Liiiiic 


IN mui U 


D-hi*5tidinp 


Dhi«; 


J-/~i^ "iii C UJ ipilCliy l<lldXLLIlC 


iNjnpnc 


5 D-i<inlpiirinp 


Dile 


JL-/ l>i iiiCLU y iUi Uiiiiw 


IN mpi O 


D -leucine ^ 


Dleu 


T .-'N^-tn pfHvl Qprinp 


'N^m Qpr 

1 » 1 1 1 0 CI 


D-lvsine 


Dlys 


T^-T^-m pfh vltVirpnninp 

X^ XXX^IXX y X LXXX wUXXJ-XX w 


TsJm thr 

IN ill nil 


-m f*th i n n 1 n f* 

1 -/ XJ.J. LI JJ. Vw^XXXi.J.V^ 


Dmpt 


"iiicui y i LI yp L'VJ uiidii 


iNIULrp 


D -omittdne 


Dom 


T .-N-m pth vl tvrn<;inp 

x^ x^ xxxwLxx y X L y X w^xxxw 


1 >( 111 uy 1 


10 D-phenylalanine 


Dphe 


L-N-m ethyl valine 


Nmval 


D-proline 


Dpro 


L-N-m ethyl ethy Igly cine 


Nmetg 


D-serine 


Dser 


T\-TsJ-rn pfh vl -t-hiitvl 1 vci n p 

x^ xxx^Lxxyx V i^uvv x^x y wxx^w 


thi 1 cr 

1 >111UL/U^ 


D -threonine 


Dthr 


L-norleiicine 


Nle 

X ^ X w 


D-tryptophan 


Dtrp 


L-norvaline 


X N V Cl 


1 5 D-tyrosine 


Dtyr 


ff- m eth V 1 - am i n o i s ohu tvrat e 

Ur xxxvrLxx y X <n 1 1 xxji.v/Xijv^ ix j. y x ex u w 


XVXCXXL' 


D-valine 


Dval 


tt-m eth vl - v-am i n nhi it vratp 

Ur XXXwiXX y X V CULXXXLXVJ 1./ LX t y X dlrw 


era nil 

IVX^CIL' Li 


D - ct-m ethyl al anin e 


Dmala 


rt-TTiPthvlp.vrJnhpwlfil aninp 

u» xxxwLxx y Xw y wXva^xx^A. y xcxxcLXXXXXw 


X/fpHpYfl 

1V±V>'11CA.CX 


D- cc-methy larginine 


Dmarg 


tt-m eth vl cvl con ent vl al an i n e 


A/fcnpn 

XVAVL/ ^XX 


D - cc-m ethy lasp aragine 


Dmasn 


tt-m eth vl-tt-nanthvl al an i n e 


XVXfl 1 i fl L/ 


20 D-a-m ethyl aspartate 


Dmasp 


tt-meth V It) eni ci 1 1 am i n e 

%M0 XXXKi/lrtl^ y f Sr* * ^ ' V* f 1 1 till 1 1 1 1 J ^* 


N4pen 


D-a-methylcysteine 


Dmcys 


N-(4-aminobutyl)glycine 


Nelu 


D-a-methylglutamine 


Dmgln 


N-(2-amino ethy l)gly cine 


Naee 


D-a-methylhistidine 


Dmhis 


N-(3 -aminoprop ynel vcine 


Norn 


D-a-methylisoleucine 


Dmile 


N-amino- tt-m ethy Ibutyrate 


Nmaabu 


j^-a-iiicuiyiicuuiiic 




tt-napthylalanine 


Anap 


D-a-methyllysine 


Dmlys 


N-b enzy Igly cine 


Nphe 


D-a-methylmethionine 


Dm met 


N-(2-carbamylethyl)glycine 


Nghi 


D-a-methylomithine 


Dmom 


N-(carbamylmethyl)glycine 


Nasn 


D-a-methylphenylalanine 


Dmphe 


N-(2-carboxyethyl)glycine 


Nglu 


30 D-a-methylproline 


Dmpro 


N-(carboxymethyl)glycine 


Nasp 
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D-a-methylserine 


Dmser 


N-cyclobutylglycine 


Ncbut 


D-a-methylthreonine 


Dmthr 


N-cycloheptylglycine 


Nchep 


D-a-methyltryptophan 

J J IT ST 


Dmtrp 


N-cyclohexylglycine 


Nchex 


D-a-methyltyrosine 


Dmty 


N-cyclodecylglycine 


Ncdec 


5 D-a-methylvaline 


Dmval 


N- cy Ico do d ecy Igly cine 


Ncdod 


D -N-m ethy laftnine 


Dnmala 


N-cyclooctylglycine 


Ncoct 


D-N-methylarginine 


Dnmarg 


N-cvclooroovlfflvcine 


Ncpro 


D-N-m ethyl asp airagine 


Dnmasn 


N-cvcloundecvlfflvcine 


Ncund 


D-N-methylaspartate 


Dnmasp 


N-(2,2-diphenylethyl)glyciiie 


Nbhm 


10 D-N-methylcysteine 


Dnmcys 


N-C3,3-diphenvlDropvl)slvcine 


Nbhe 


D-N-methylglutamine 


Dnmgln 


N-(3-guanidinopropyl)glycine . 


Narg 


D-N-methylglutamate 


Dnmglu 


N-( 1 -hydroxy ethyl)gly cine 


Nthr 


D-N-methylhistidine 


Dnmhis 


N-(hydroxyethyl))glycine 


Nser 


D-N-methylisoleucine 


Dnmile 


N-(imidazolylethyl))glycLne 


Nhis 


15 D-N-m ethylleucine 


Dnmleu 


N-(3 -indolyly ethyl)gly cine 


Nhtrp 


D-N-methyllysine 


Dnmlys 


N-methyl-y-aminobutyrate 


Nmgabu 


N-methylcyclohexylalanine 


Nmchexa 


D-N-methylmethionine 


Dnmmet 


D-N-methylomitliine 


Dnmorn 


N-methylcyclopentylalanine 


Nmcpen 


N-methylglycine 


Nala 


D-N-methylphenylalanine 


Dnmphe 


20 N-methylaminoisobutyrate 


Nmaib 


D-N-methylproline 


Dnmpro 


N-(l -methylpropyl)glycine 


Nile 


D-N-methylserine 


Dnmser 


N-(2-methylpropyl)glycine 


Nleu 


D-N-methylthreonine 


Dnmthr 


D-N-methyltryptophan 


Dnmtrp 


N-(l-methylethyl)glycine 


Nval 


D-N-methyltyrosine 


Dnmtyr 


N-methyla-napthylalanine 


Nmanap 


25 D-N-methylvaline 


Dnmval 


N-m eth vl nen i ci 1 1 am i n p 


Nmnpn 


Y-aminobutyric acid 


Gabu 


N-(p-hydroxyphenyl)glycine 


Nhtyr 


L-/-butylglycine 


Tbug 


N-(thiomethyl)glycine 


Ncys 


L-ethylglycine 


Etg 


penicillamine 


Pen 


L-homophenylalanine 


Hphe 


L-a-methylalanine 


Mala 


30 L-a-methylarginme 


fvlarg" 


L-a-met±iylasparagine 


Masn 



P:\OPER\EJH\SOCS-n.PRV - 14/2/97 



- 15 - 



L-a-methylaspartate 


Masp 


L-C£-methvl-/-hiitvl0lvcine 


A4tbviff 


L-a-methyl cysteine 


Mcys 


L-meth vlethvl &1 vcine 


Nletff 


L-a-methylglutamiiie 


Meln 


L - tt - m eth V 1 1 u t am at e 


Mfflii 

IVX^lLi 


L- a-m ethy Ihistidine 


Mhis 


L- tt-niethy Ihoniopheny lalaniiie 


Mhnhe 


L-cc-methylisoleucixie 


Mile 


N- ( 2-m eth vl th i n eth vH vni n p 

J- ^ \^ lAx^tii y luiiiw tijj. y A jgiy wxii^ 


'N^m pt 

1 ^ lll&L 


L-tt-methvlleiiciiie 


Mleu 


L- tt-m etVi vl 1 v<5i n e 

u» iiiwui y 11 y 3111^ 


IV/Tl vc 


- ff -m eth vl m eth i on i n e 

-1— / U« M i I \J h.1 I Jf till \^ 1-1 llV-flllll \J 


Mxnet 


T .-rt-rnpfVi vlnnrlpiif^inp 

±^ %A XXl^LLl Ylll\Jl ICLlwlUw 


TV^n 1 f> 
1VJJJJ.C 


T -rf-mptVivlnnrvfllinp 


ivjxi V a 


T — mptV»\7lr\"mi+fiiTip 
i-* ix~ JJJ. c iiiy lu 1 JJJ^ uiiii c 


Mom 


T -nt-mpthvlnHpTivlalaniTiR 


IVAjJilC 




^4pro 


T -lY-m PtVivl^prinp 
u> 1X1 wLu. y lo^x 




i-/~L6-iJJCLliyiU-ll CvJlLLIlC 


iviinr 


L-a-methyltryptophan 


Mtrp 


L-a-methyl tyrosine 


Mtyr 


L-a-methylvaline 


Mval 


L-N-methylhomophenylalanine 


Nmhphe 


N-(N-(2,2-diphenylethyl) 


Nnbhm 


N-(N-(3,3-diphenylpropyl) 


Nnbhe 


carbamylmethyl)glycine 




carb amy Im ethy l)gly cine 




1 -carboxy- 1 -(2,2-diphenyI- 


Nmbc 






ethylamino)cyclopropane 









Other derivatives conteinplated by the present invention include a range of glycosylation 
20 variants from a completely unglycosylated molecule to a modified glycosylated molecule. 
Altered glycosylation patterns may result from expression of recombinant molecules in 
different host cells. 

Another embodiment of the present invention contemplates a method for modulating 
25 expression of a SOCS protein in a maromal, said method comprising contacting a gene 
encoding a SOCS or a factor/element involved in controlling expression of the SOCS gene 
with an effective amount of a modulator of SOCS expression for a time and under conditions 
sufficient to up-regulate or down-regulate or otherwise modulate expression of SOCS. An 
example of a modulator is a cj^tokine such as IL-6 or other transcription regulators of SOCS 
30 expression. ' ^ ' ***** ' 
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Expression includes transcription or translation or both. 

Another aspect of the present invention contemplates a method of modulating activity of 
SOCS in a human, said method comprising administering to said maromal a modulating 
5 effective amount of a molecule for a time and under conditions sufficient to increase or 
decrease SOCS activity. The molecule may be a proteinaceous molecule or a chemical entity 
and may also be a derivative of SOCS or a chemical analogue or truncation mutant of SOCS. 

A further aspect of the present invention provides a method of inducing synthesis of a SOCS 
10 or transcription/translation of a SOCS comprising contacting a cell containing a SOCS gene 
with an effective amount of a cytokine capable of inducing said SOCS for a time and under 
conditions sufficient for said SOCS to be produced. For example, SOCS 1 may be induced by 
IL-6. 

1 5 A further aspect of the present invention contemplates a range of mimetics or small molecules 
capable of acting as agonists or antagonists of the SOCS. Such molecules may be obtained 
from natural product screening such as from coral, soil, plants or the ocean or antarctic 
environments. Altematively, peptide, polypeptide or protein hbraries may be readily screened. 
For example. Ml cells expressing a SOCS do not undergo proliferation in the presence of IL- 

20 6, This system can be used to screen molecules which permit proliferation in the presence of 
IL-6 aad a SOCS. A range of test cells may be prepared to screen for antagonists and agonists 
for a range of cytokines. Such molecules are preferably small molecules and may be of amino 
acid origin or of chemical origin. 

25 Accordingly, the present invention contemplates a pharmaceutical composition comprising 
SOCS or a derivative thereof or a modulator of SOCS expression or SOCS activity and one 
or more pharmaceutically acceptable carriers and/or diluents. These components are referred 
to as the "active ingredients". 

30 Although these and other aspects of the present invention apply to any SOCS, one preferred 
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SOCS is SOCSL Other preferred SOCS molecules include SOCS2 and SOCS3. 

The pharmaceutical forms suitable for injectable use include sterile aqueous solutions (where 
water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile 
5 injectable solutions or dispersion or may be in the form of a cream or other form suitable for 
topical application. It must be stable under the conditions of manufacture and storage and 
must be preserved against the contaminating action of microorganisms such as bacteria and 
fungi. The carrier can be a solvent or dispersion medium containing, for example, water, 
ethanol, polyol (for example, glycerol, propylene glycol and liquid polyethylene glycol, and 

10 the like), suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained, 
for example, by the use of a coating such as licithin, by the maintenance of the required 
particle size in the case of dispersion and by the use of superfactants. The preventions of the 
action of microorganisms can be brought about by various antibacterial and antifungal agents, 
for example, parabens, chlorobutanol, phenol, sorbic acid, thirmerosal and the like. In many 

15 cases, it will be preferable to include isotonic agents, for example, sugars or sodium chloride. 
Prolonged absorption of the injectable compositions can be brought about by the use in the 
compositions of agents delaying absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the active compounds in the required 
20 amount in the appropriate solvent with various of the other ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by 
incorporating the various sterilized active ingredient into a sterile vehicle which contains the 
basic dispersion medixmi and the required other ingredients from those enumerated above. In 
the case of sterile powders for the preparation of sterile injectable solutions, the preferred 
25 methods of preparation are vacuum drying and the freeze-drying technique which yield a 
powder of the active ingredient plus any additional desired ingredient from previously 
sterile-filtered solution thereof. 

When the active ingredients are suitably protected they may be orally administered, for 
30 exMciple, withT an inert diluent or vvii an assimilable edible carrier, or it may be enclosed in 
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hard or soft shell gelatin capsule, or it may be compressed into tablets, or it may be 
incorporated directly with the food of the diet. For oral therapeutic administration, the active 
compound may be incorporated with excipients and used in the form of ingestible tablets, 
buccal tablets, troches, capsules, elixirs, suspensions, syrups, wafers, and the like. Such 
5 compositions and preparations should contain at least 1% by weight of active compound. The 
percentage of the compositions and preparations may, of course, be varied and may 
conveniently be between about 5 to about 80% of the weight of the unit. The amount of active 
compound in such therapeutically useful compositions in such that a suitable dosage will be 
obtained. Preferred compositions or preparations according to the present invention are 
10 prepared so that an oral dosage unit form contains between about 0.1 /zg and 2000 mg of 
active compound. 

The tablets, troches, pills, capsules and the like may also contain the components as listed 
hereafter: A binder such as gum, acacia, com starch or gelatin; excipients such as dicalcium 

1 5 phosphate; a disintegrating agent such as com starch, potato starch, alginic acid and the like; 
a lubricant such as magnesium stearate; and a sweetening agent such a sucrose, lactose or 
saccharin may be added or a flavouring agent such as peppermint, oil of wintergreen, or cherry 
flavouring. When the dosage unit form is a capsule, it may contain, in addition to materials 
of the above type, a liquid carrier. Various other materials may be present as coatings or to 

20 otherwise modify the physical form of the dosage unit. For instance, tablets, pills, or capsules 
may be coated with shellac, sugar or both. A syrup or elixir may contain the active compound, 
sucrose as a sweetening agent, methyl and propylparabens as preservatives, a dye and 
flavouring such as cherry or orange flavour. Of course, any material used in preparing any 
dosage unit form should be pharmaceutically pure and substantially non-toxic in the amounts 

25 employed. In addition, the active compound(s) may be incorporated into sustained-release 
preparations and formulations. 

The present invention also extends to forms suitable for topical application such as creams, 
lotions and gels. 
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Phannaceutically acceptable carriers and/or diluents include any and all solvents, dispersion 
media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents 
and the like. The use of such media and agents for pharmaceutical active substances is well 
known in the art. Except insofar as any conventional media or agent is incompatible with the 
5 active ingredient, use thereof in the therapeutic compositions is contemplated. Supplementary 
active ingredients can also be incorporated into the compositions. 

It is especially advantageous to formulate parenteral compositions in dosage unit form for ease 
of administration and uniformity of dosage. Dosage unit form as used herein refers to 

10 physically discrete units suited as unitary dosages for the mammalian subjects to be treated; 
each unit containing a predetermined quantity of active material calculated to produce the 
desired therapeutic effect in association with the required pharmaceutical carrier. The 
specification for the novel dosage unit forms of the invention are dictated by and directly 
dependent on (a) the unique characteristics of the active material and the particular therapeutic 

15 effect to be achieved, and (b) the hmitations inherent in the art of compounding such an active 
material for the treatment of disease in living subjects having a diseased condition in which 
bodily health is impaired as herein disclosed in detail. 

The principal active ingredient is compounded for convenient and effective administration in 
20 effective amounts with a suitable pharmaceuticaliy acceptable carrier in dosage unit form as 
hereinbefore disclosed. A unit dosage form can, for example, contain the principal active 
compound in amoxmts ranging from 0.5 \ig to about 2000 mg. Expressed in proportions, the 
active compound is generally present in from about 0.5 ^g to about 2000 mg/ml of carrier. 
In the case of compositions containing supplementary active ingredients, the dosages are 
25 determined by reference to the usual dose and manner of administration of the said 
ingredients. 

The pharmaceutical composition may also comprise genetic molecules such as a vector 
capable of transfecting target cells where the vector carries a nucleic acid molecule capable 
30 of modulating SOCS expression or SOCS activity. The vector may, for example, be a viral 
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vector. 

Still another aspect of the present invention is directed to antibodies to SOCS and its 
derivatives. Such antibodies may be monoclonaTor polyclonal and may be selected from 
5 naturally occurring antibodies to SOCS or may be specifically raised to SOCS or derivatives 
thereof In the'^case of the latter, SOCS or its derivatives may first need to be associated with 
a carrier molecule. The antibodies and/or recombinant SOCS or its derivatives of the present 
invention are particularly useful as therapeutic or diagnostic agents. 

10 For example, SOCS and its derivatives can be used to screen for naturally occurring antibodies 
to SOCS. These may occur, for example in some autoimmune diseases. > Alternatively, 
specific antibodies can be used to screen for SOCS. Techniques for such assays are well 
known in the art and include, for example, sandwich assays and ELISA. Knowledge of SOCS 
levels may be important for diagnosis of certain cancers or a predisposition to cancers or 

1 5 monitoring cytokine mediated cellular responsiveness or for monitoring certain therapeutic 
protocols. 

Antibodies to SOCS of the present invention may be monoclonal or polyclonal. 
Altematively, fragments of antibodies may be used such as Fab fragments. Furthermore, the 
20 present invention extends to recombinant and synthetic antibodies and to antibody hybrids, 
A "synthetic antibody" is considered herein to include fragments and hybrids of antibodies. 
The antibodies of this aspect of the present invention are particularly useful for 
immunotherapy and may also be used as a diagnostic tool for assessing apoptosis or 
monitoring the program of a therapeutic regimin. 

25 

For example, specific antibodies can be used to screen for SOCS proteins. The latter would 
be important, for example, as a means for screening for levels of SOCS in a cell extract or 
other biological fluid or purifying SOCS made by recombinant means from culture 
supernatant fluid. Techniques for the assays contemplated herein are known in the art and 
30 include, for example, sandwich assays' and ELISA. 
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It is within the scope of this invention to include any second antibodies (monoclonal, 
polyclonal or fragments of antibodies or synthetic antibodies) directed to the first mentioned 
antibodies discussed above. Both the first and second antibodies may be used in detection 
assays or a first antibody may be used with a commercially available anti-immunoglobulin 
5 antibody. An antibody as contemplated herein includes any antibody specific to any region 
ofSOCS. ^ 

Both polyclonal and monoclonal antibodies are obtainable by immunization with the enzyme 
or protein and either type is utilizable for immimoassays. The methods of obtaining both types 

10 of sera are well known in the art. Polyclonal sera are less preferred but are relatively easily 
prepared by injection of a suitable laboratory animal with an effective amount of SOCS, or 
antigenic parts thereof, collecting serum from the animal, and isolating specific sera by any 
of the known immunoadsorbent techniques. Although antibodies produced by this method are 
utilizable in virtually any type of immunoassay, they are generally less favoured because of 

15 the potential heterogeneity of the product. 

The use of monoclonal antibodies in an immunoassay is particularly preferred because of the 
ability to produce them in large quantities and the homogeneity of the product. The 
preparation of hybridoma cell lines for monoclonal antibody production derived by fusing an 
20 immortal cell line and lymphocytes sensitized against the immunogenic preparation can be 
done by techniques which are well known to those who are skilled in the art. 

Another aspect of the present invention contemplates a method for detecting SOCS in a 
biological sample from a subject said method comprisiog contacting said biological sample 
25 with an antibody specific for SOCS or its derivatives or homologues for a time and under 
conditions sufficient for an antibody-SOCS complex to form and then detecting said complex. 

The presence of SOCS may be accompUshed in a number of ways such as by Western blotting 
and ELISA procedures. A wide range of immunoassay techniques are available as can be 
30 seen by reference to US Patent Nos. 4,016,043, 4, 424,279 and 4,018,653. These, of course. 
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include both single-site and two-site or "sandwich" assays of the non-competitive types, as 
well as in the traditional competitive binding assays. These assays also include direct binding 
of a labelled antibody to a target. 

5 Sandwich assays are among the most useful and commonly used assays and are favoured for 
use in the present invention. A number of variations of the sandwich assay technique exist, 
and all are intended to be encompassed by the present invention. Briefly, in a typical forward 
assay, anunlabelled antibody is immobilized on a solid substrate and the sample to be tested 
brought into contact with the bound molecule. After a suitable period of incubation, for a 

10 period of time sufficient to allow formation of an antibody-antigen complex, a second 
antibody specific to the antigen, labelled with a reporter molecule capable of producing a 
detectable signal is then added and incubated, allowing time sufficient for the formation of 
another complex of antibody-antigen-labelled antibody. Any unreacted material is washed 
away, and the presence of the antigen is determined by observation of a signal produced by 

15 the reporter molecule. The results may either be qualitative, by simple observation of the 
visible signal, or may be quantitated by comparing with a control sample containing known 
amounts of hapten. Variations on the forward assay include a simultaneous assay, in which 
both sample and labelled antibody are added simultaneously to the bound antibody. These 
techniques are well known to those skilled in the art, including any minor variations as will 

20 be readily apparent. In accordance with the present invention the sample is one which might 
contain SOCS including cell extract, tissue biopsy or possibly serum, saliva, mucosal 
secretions, lymph, tissue fluid and respiratory fluid. The sample is, therefore, generally a 
biological sample comprising biological fluid but also extends to fermentation fluid and 
supernatant fluid such as from a cell culture. 

25 

In the typical forward sandwich assay, a first antibody having specificity for the SOCS or 
antigenic parts thereof, is either covalently or passively bound to a solid surface. The solid 
surface Is typically glass or a polymer, the most commonly used polymers being cellulose, 
polyacrylamide, nylon, polystyrene, polyvinyl chloride or polypropylene. The solid supports 
30 may be in the form of tubes, beads, discs of microplates, or any other surface suitable for 
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conducting an immunoassay. The binding processes are well-known in the art and generally 
consist of cross-linking covalently binding or physically adsorbing, the polymer-antibody 
complex is washed in preparation for the test sample. An aliquot of the sample to be tested 
is then added to the soUd phase complex and incubated for a period of time sufficient (e.g. 2- 
5 40 minutes) and under suitable conditions (e.g. 25 ^^C) to allow binding of any subunit present 
in the antibody"; Following the incubation period, the antibody subunit solid phase is washed 
and dried and incubated with a second antibody specific for a portion of the hapten. The 
second antibody is linked to a reporter molecule which is used to indicate the binding of the 
second antibody to the hapten, 

10 
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An alternative method involves immobilizing the target molecules in the biological sample and 
then exposing the immobilized target to specific antibody which may or may not be labelled 
with a reporter molecule. Depending on the amount of target and the strength of the reporter 
molecule signal, a bound target may be detectable by direct labelling with the antibody. 
5 Altematively, a second labelled antibody, specific to the first antibody is exposed to the target- 
first antibody complex to form a target-first antibody-second antibody tertiary complex. The 
complex is detected by the signal emitted by the reporter molecule. 

By "reporter molecule" as used in the present specification, is meant a molecule which, by its 

10 chemical nature, provides an analytically identifiable signal which allows the detection of 
antigen-bound antibody. Detection may be either qualitative or quantitative. The most 
commonly used reporter molecules in this type of assay are either enzymes, fluorophores or 
radionuclide containing molecules (i.e. radioisotopes) and chemiluminescent molecules. 
In the case of an enzyme immunoassay, an enzyme is conjugated to the second antibody, 

15 generally by means of glutaraldehyde or periodate. As will be readily recognized, however, 
a wide variety of different conjugation techniques exist, which are readily available to the 
skilled artisan. Commonly used enzymes include horseradish peroxidase, glucose oxidase, 
beta-galactosidase and alkaline phosphatase, amongst others. The substrates to be used with 
the specific enzymes are generally chosen for the production, upon hydrolysis by the 

20 corresponding enzyme, of a detectable colour change. Examples of suitable enzymes include 
alkaline phosphatase and peroxidase. It is also possible to employ fluorogenic substrates, 
which yield a fluorescent product rather than the chromogenic substrates noted above. In all 
cases, the enzyme-labelled antibody is added to the fu-st antibody hapten complex, allowed to 
bind, and then the excess reagent is washed away. A solution containing the appropriate 

25 substrate is then added to the complex of antibody- antigen-antibody. The substrate will react 
with the enzyme linked to the second antibody, giving a qualitative visual signal, which may 
be further quantitated, usually spectrophotometrically, to give an indication of the amount of 
hapten which was present in the sample. "Reporter molecule" also extends to use of cell 
agglutination or inhibition of agglutination such as red blood cells on latex beads, and the like. 
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Alternately, fluorescent compounds, such as fluorescein and rhodamine, may be chemically 
coupled to antibodies without altering their binding capacity. When activated by illumination 
with light of a particular wavelength, the fluorochrome-labelled antibody adsorbs the light 
energy, inducing a state to excitability in the molecule, followed by emission of the light at 
5 a characteristic colour visually detectable with a light microscope. As in the EIA, the 
fluorescent laBelled antibody is allowed to bind to the first antibody-hapten complex. After 
washing off the unbound reagent, the remaining tertiary complex is then exposed to the light 
of the appropriate wavelength the fluorescence observed indicates the presence of the hapten 
of interest. Immunofluorescene and EIA techniques are both very well established in the art 
10 and are particularly preferred for the present method. However, other reporter molecules, such 
as radioisotope, chemiluminescent or bioluminescent molecules, may also be employed. 

The present invention also contemplates genetic assays such as involving PCR analysis to 
detect SOCS gene or its derivatives. Alternative methods or methods used in conjunction 
15 include direct nucleotide sequencing or mutation scanning such as single stranded 
conformation polymorphisms analysis (SSCP) as specific oligonucleotide hybridisation, as 
methods such as direct protein truncation tests. 

The nucleic acid molecules of the present invention may be DNA or RNA. When the nucleic 
20 acid molecule is in DNA form, it may be genomic DNA or cDNA. RNA forms of the nucleic 
acid molecules of the present invention are generally mRNA. 

Although the nucleic acid molecules of the present invention are generally in isolated form, 
they may be integrated into or ligated to or otherwise fused or associated with other genetic 
25 molecules such as vector molecules and in particular expression vector molecules. Vectors 
and expression vectors are generally capable of replication and, if applicable, expression in 
one or both of a prokaryotic cell or a eukaryotic cell. Preferably, prokaryotic cells include E. 
coli, Bacillus sp and Pseudomonas sp. Preferred eukaryotic cells include yeast, fungal, 
mammalian and insect cells. 
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Accordingly, another aspect of the present invention contemplates a genetic construct 
comprising a vector portion and a mammalian and more particularly a human SOCS gene 
portion, which SOCS gene portion is capable of encoding a SOCS polypeptide or a functional 
or immunologically interactive derivative thereof. 

5 

Preferably, thi SOCS gene portion of the genetic construct is operably linked to a promoter 
on the vector such that said promoter is capable of directing expression of said SOCS gene 
portion in an appropriate cell. 

10 In addition, the SOCS gene portion of the genetic construct may comprise all or part of the 
gene fused to another genetic sequence such as a nucleotide sequence encoding glutathione-S- 
transferase or part thereof. 

The present invention extends to such genetic constructs and to prokaryotic or eukaryotic cells 
1 5 comprising same. 

The present invention also extends to any or all derivatives of SOCS including mutants, part, 
fragments, portions, homologues and analogues or their encoding genetic sequence including 
single or multiple nucleotide or amino acid substitutions, additions and/or deletions to the 
20 naturally occurring nucleotide or amino acid sequence. The present invention also extends to 
mimetics and agonists and antagonists of SOCS. 

The SOCS and its genetic sequence of the present invention will be useful in the generation 
of a range of therapeutic and diagnostic reagents and will be especially useful in the detection 
25 of a cytokine involved in a particular cellular response or a receptor for that cytokine. For 
example, cells expressing SOCS gene such as Ml cells expressing the SOCSl gene, will no 
longer be responsive to a particular cytokine such as in the case of SOCS 1 , IL-6. Clearly, the 
present invention further contemplates cells such as Ml cells expressing the SOCS2 or S0CS3 
gene. 

30 
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Soluble SOCS polypeptides are also contemplated to be particularly useful in the treatment 
of disease, injury or abnormality involving cytokine mediated cellular responsiveness such as 
hyperimmunity, immunosuppression, allergies, hypertension and the like. 

5 A further aspect of the present invention contemplates the use of SOCS or its functional 
derivatives in^the manufacture of a medicament for the treatment of conditions involving 
cytokine mediated cellular responsiveness. 

The present invention further contemplates transgenic mammalian cells expressing a SOCS 
10 gene. Such cells are useful indicator cell lines for assaying for suppression of cytokine 
function. One example is Ml cells expressing the SOCSl gene. Such cell lines may be useful 
for screening for cytokines or screening molecules such as naturally occurring molecules from 
plants, coral, microorganisms or bio-organically active soil or water capable of acting as 
cytokine antagonists or agonists. 

15 

The present invention further contemplates hybrids between different SOCS from the same 
or different animal species. For example, a hybrid may be formed between all or a functional 
part of mouse SOCSl and human SOCSl. Alternatively, the hybrid may be between all or 
part of mouse SOCSl and mouse SOCS2. All such hybrids are contemplated herein and are 
20 particularly useful in developing pleiotropic molecules. 

The present invention further contemplates a range of genetic based diagnostic assays 
screening for individuals with defective SOCS genes. Such mutations may result in cell types 
not being responsive to a particular cytokine or resulting in over responsiveness leading to a 
25 range of conditions. The SOCS genetic sequence can be readily verified using a range of PCR 
or other techniques to determine whether a mutation is resident in the gene. Appropriate gene 
therapy or other interventionist therapy may then be adopted. 

The present invention is further described by the following non-limiting Figures and 
30 Examples. 
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In the Figures: 

Figure 1 is a diagrammatic representation showing generation of an IL-6-unresponsive Ml 
clone by retroviral infection. The RUFneo retrovirus, showing the position of landmark 
5 restriction endonuclease cleavage sites, the 4A2 cDNA insert and the position of PGR primer 
sequences. ^ 

Figure 2 is a photographic representation of Southern and Northern analysis. (Left and 
Middle Panels) Southern blot analysis of genomic DNA from clone 4A2 and a control 

10 infected Ml clone. DNA was digested with BamH I, to reveal the number of retroviruses 
carried by each clone, and Sac I, to estimate the size of the retroviral cDNA insert. Left 
panel; probed with neo. Right panel; probed with the Xho I-digested 4A2 PGR product. 
(Right Panel) . Northern blot analysis of total RNA from clone 4A2 and a control infected 
Ml clone, probed with the Xho I-digested 4A2 PGR product. The two bands represent 

15 unspliced and spliced retroviral transcripts, resulting from splice donor and acceptor sites 
in the retroviral genome. 

Figure 3 is a representation of the nucleotide sequence and structure of the SOGSl gene. 
A. The genomic context of SOGSl in relation to the protamine gene cluster on murine 

20 chromosome 16. The accession number of this locus is MMPRMGNS (direct submission; 
G. Schlueter, 1995) for the mouse and BTPRMTNP2 for the rat (direct submission; G. 
Schlueter, 1996). B. The nucleotide sequence of the SOGSl cDNA and deduced amino acid 
sequence. Gonventional one letter abbreviations are used for the amino acid sequence and 
the asterisk indicates the stop codon. The polyadenylation signal sequence is underlined. 

25 The coding region is shown in uppercase and the untranslated region is shown in lower case. 

Figure 4 is a graphical representation of cell differentiation in the presence of cytokines. 
Semi-solid agar cultures of parental Ml cells (Ml and Ml.mpl) and Ml cells expressing 
SOGSl (4A2 and Ml.mpl. SOGSl), were used and the percentage of colonies which 
30 differentiated in response to a titration of 1 mg/ml IL-6 ( ), 100 ng/ml LIP (0), 1 mg/ml 
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OSM (□), 100 ng/ml IFN-y (^), 500 ng/ml TPO (•), or 3x10"^ M dexamethasone (*) 
determined. 

Figure 5 is a photographic representation of cytospins of liquid cultures of parental Ml cells 
5 (Ml and Ml.mpl) ) and Ml cells expressing SOCSl (4A2 and Ml.mpl.SOCSl) cultured for 
4 days in the presence of 10 ng/ml IL-6 or saline. Unlike parental Ml cells, morphological 
features consistent with macrophage differentiation are not observed in Ml cells 
constitutively expressing SOCSl (4A2 and Ml.mpl.SOCSl) when cultured in IL-6. 

10 Figure 6 is a photographic representation showing inhibition of phosphorylation of signalling 
molecules by SOCSl. Parental Ml cells (Ml and Ml.mpl) and Ml cells expressmg SOCSl 
(4A2 and Ml.mpl.SOCSl) were incubated in the absence (-) or presence (+) of 10 ng/ml 
of IL-6 for 4 minutes at 37**C . Cells were then lysed and extracts were either 
immunopreciptated using anti-mouse gpl30 antibody prior to SDS-PAGE (two upper panels) 

15 or were elecectrophoresed directly (two lower panels). Gels were blotted and the filters 
were then probed with anti-phosphotyrosine (upper panel), anti-gpl30 antibody (second top 
panel), anti-phospho-STAT3 (second bottom panel) or anti-STAT3 (lower panel). Blots 
were visualised using, peroxidase-conjugated secondary antibodies and Enhanced 
Chemiluminescence (ECL) reagents. 

20 

Figure 7 is a representation of protein extracts prepared from (A) Ml cells or Ml cells 
expressing SOCSl (4A2) and (B) Ml.mpl cells or Ml.mpl.SOCSl cells mcubated for 10 
min at37'*Cinl0ml serum-free DME containing either saline, 100 ng/ml IL-6 or 100 
ng/ml IFN-y. The binding reactions contained 4-6 fjLg protein (constant within a given 
25 experiment), 5 ng ^^P-labelled m67 oligonucleotide encoding the high affinity SIP (c-sis- 
inducible factor) binding site, and 800 ng sonicated salmon sperm DNA. For certain 
experiments, protein samples were preincubated with an excess of unlabelled m67 
oligonucleotide, or antibodies specific for either STATl or STAT3. 

30 Figure 8 is a photographic representation of Northern hybridisation. Mice were injected 



P:\OPER\EJH\SOCS-n.PRV - 14/2/97 



-30- 

intravenoxisly with 2 fxg and after various periods of time, the livers were removed and 
polyA+ mRNA was purified. Ml cells were stimulated for various lengths of time with 
500 ng/ml of IL-6, after which polyA+ mRNA was isolated, mRNA was fractionated by 
electrophoresis and immobilized on nylon filters. Northern blots were prehybridised, 
5 hybridized with random-primed ^^P-labelled SOCSl or GAPDH DNA fragments, washed 
and exposed tb film overnight. 

Figure 9 is a representation of a comparison of the amino acid sequences of SOCSl, 
SOCS2, SOCS3 and CIS. Alignment of the predicted amino acid sequence of mouse (mm), 
10 human (hs) and rat (rr) SOCSl, SOCS2, SOCS3 and CIS. Those residues shaded are 
conserved in three or four mouse SOCS family members. The SH2 domain is boxed in solid 
lines, while the SOCS box is bounded by double lines. 

Figure 10 is a photographic representation showing the phenotype of IL-6 unresponsive Ml 
15 cell clone, 4A2. Colonies of parental Ml cells (left panel) and clone 4A2 (right panel) 
cultured in semi-solid agar for 7 days in saline or 100 ng/ml IL-6. 

Figure 11 is a photographic representation showing expression of mRNA for SOCS family 
members in vitro and in vivo. 
20 (A) Northern analysis of mRNA from a range of mouse organs showing constitutive 
expression of SOCS family members in a limited number of tissues. 

(B) Norther analysis of mRNA from liver and Ml cells showing induction of expression 
of SOCS family members following exposure to IL-6. 

(C) Reverse transcriptase PCR analysis of mRNA from bone marrow showing induction 
25 of expression of SOCS family members by a range of cytokines. 

Rgure 12 is a photographic representation showing SOCSl suppresses the phosphorylation 
and activation of gpl30 and STAT-3. 

(A) Western blots of extracts from parental Ml cells (Ml and Ml.mpl) and Ml cells 
30 expressing SOCSl (4A2 and Ml.mpl.SOCSl) stimulated with (+) or without (-) 100 ng/ml 
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IL-6. Top: Extracts immunoprecipitated with antu-gplSO (agpl30) and immunoblotted with 
anti-phosphotyrosine (aPY-STAT3), or for STAT3 (aSTAT3) to demonstrate equal loading 
of protein. The molecular weights of the bands are shown on the right. 
(B) EMSA of Ml.mpl and Ml.mpl.SOCSl cells stimulated with (+) and without (-) 100 
5 ng/ml IL-6 or 100 ng/ml IFNy. The DNA-binding complexes SIF A, B, and C are 
indicated at th^ left. 
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SUMMARY OF SEQUENCE mENTITY NUMBERS 



SEQUENCE SEQ ID NO. 

PGR Primer 1 

PGR Primer 2 

Mouse SOCSl (nucleotide) 3 

' Mouse SOCSl (amino acid) 4 

Mouse SOCS2 (nucleotide) 5 

Mouse SOCS2 (amino acid) 6 

Mouse SOCS3 (nucleotide) 7 

Mouse SOCS3 (amino acid) 8 

Human SOCS 1 (nucleotide) 9 

Human SOCSl (amino acid) 10 

Rat SOCS 1 (nucleotide) 1 1 

Rat SOCSl (amino acid) 12 



f 
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EXAMPLE 1 
CELL CULTURE AND CYTOKINES 

The Ml cell line was derived from a spontaneously arising lexikaemia in SL mice [Ichikawa, 
1969]. Parental Ml cells used in this study have been in passage at the Walter and Eliza 
5 Hall Institute for Medical Research, Melbourne, Victoria, Australia, for approximately 10 
years. Ml cells were maintained by weekly passage in Dulbecco*s modified Eagle's medium 
(DME) containing 10% (v/v) foetal bovine serum (PCS). Recombinant cytokines are 
generally available from commercial sources or were prepared by published methods. 
Recombinant murine LIF was produced in Escherichia coli and purified, as previously 
10 described [Gearing, 1989], Purified human oncostatin M was purchased from PeproTech 
Inc (Rocky Hill, NJ, USA), and purified mouse IFN-y was obtained from Genzyme 
Diagnostics (Cambridge, MA, USA). Recombinant murine thrombopoietin was produced 
as a FLAGTM-tagged fusion protein in CHO cells and then pxirified. 

15 EXAMPLE 2 

AGAR COLONY ASSAYS 
In order to assay the differentiation of Ml cells in response to cytokines, 300 cells were 
cultured in 35 mm Petri dishes containing 1 ml of DME supplemented with 20% (v/v) fital 
calf serum (FCS), 0.3%(w/v) agar and 0.1 ml of serial dilutions of IL-6, LIF, GSM, IFN-y, 

20 tpo or dexamethasone (Sigma Chemical Company, St Louis, MI). After 7 days culture at 
ST^C m a fully humidified atmosphere, containing 10% (v/v) CO2 in air, colonies of Ml 
cells were counted and classified as differentiated if they were composed of dispersed cells 
or had a corona of dispersed cells around a tightly packed centre. 

25 EXAMPLE 3 

GENERATION OF RETROVIRAL LIBRARY 

A cDNA expression library was constructed from the factor-dependent haemopoietic cell line 
FDC-Pl, essentially as described [Rayner, 1994], Briefly, cDNA was cloned into the 
retroviral vector pRUFneo and then transfected into an amphotrophic packaging cell line 
30 (PA317). Transiently generated virus was harvested from the cell supernatant at 48 hr 
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posttransfection, and used to infect Y2 ecotropic packaging cells, to generate a high titre 
virus-producing cell line. 

EXAMPLE 4 

5 RETROVIRAL DICTION OF Ml CELLS 

Pools of 10^ infected T2 cells were irradiated (3000 rad) and cocultivated with 10^ Ml cells 
in DME supplemented with 10%(v/v) FCS and 4 /xg/ml Polybrene, for 2 days at 37°C. To 
select for IL-6-unresponsive clones, retrovirally- infected Ml cells were washed once in 
DME, and cultured at approximately 2x10^ cells/ml in 1 ml agar cultures containing 400 
10 fig/ml geneticin (GibcoBRL, Grand Island, NY) and 100 ng/ml IL-6. The efficiency of 
infection of Ml cells was 1-2%, as estimated by agar plating the infected cells in the 
presence of geneticin only. 

EXAMPLE 5 

15 PCR 

Genomic DNA from retrovirally-infected Ml cells was digested with Sac I and 1 fxg of 
phenol/chloroform extracted DNA was then amplified by polymerase chain reaction (PCR). 
Primers used for amplification of cDNA inserts from the integrated retrovirus were GAG3 
(5* CACGCCGCCCACGTGAAGGC 3' [SEQ ID NO: I]), which corresponds to the vector 

20 gag sequence approxunately 30 bp 5' of the multiple cloning site, and HSVTK (5* 
TTCGCCAATGACAAGACGCT 3' [SEQ ID NO:2]), which corresponds to the pMClneo 
sequence approximately 200 bp 3' of the multiple cloning site. The PCR entailed an initial 
denaturation at 94**C for 5 mm, 35 cycles of denaturation at 94**C for 1 min, annealing at 
SG'^C for 2 min, and extension at 72°C for 3 min, followed by a fmal 10 min extension. 

25 PCR products were gel purified and then ligated into the pGEM-T plasmid (Promega, 
Madison, WI), and sequenced using an ABI PRISM Dye Terminator Cycle Sequencing Kit 
and a Model 373 Automated DNA Sequencer (Applied Biosystems Inc., Foster City, CA). 



f 
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EXAMPLE6 
CLONING OF cDNAs 

Independent cDNA clones encoding mouse SOCSl were isolated from a murine thymus 
cDNA library essentially as described (Hilton et al, 1994). The nucleotide and predicted 
5 amino acid sequences of mouse SOCSl cDNA were compared to databases using the 
BLASTN and'TFASTA algorithms (Pearson and Lipman, 1988; Pearson, 1990; Altshcul 
etal, 1990). Oligonucleotides were designed from the ESTs encoding human SOCSl and 
mouse SOC-1 and SOCS3 and used to probe commercially available mouse thymus and 
spleen cDNA libraries. Sequencing was performed using an ABI automated sequencer 
10 according to the manufacturer's instructions. 

EXAMPLE 7 

SOUTHERN AND NORTHERN BLOT ANALYSES AND RT-PCR 

^^P-labelled probes were generated using a random decanucleotide labelling kit (Bresatec, 
15 Adelaide, South Australia) from a 600 bp Pst I fragment encoding neomycin 
phophotransfease from the plasmid pPGKneo, 1070 bp fragment of the SOCSl gene 
obtained by digestion of the 1.4 kbp PCR product with Xho I, S0CS2, SOCS3, CIS and a 
1.2 kbp fragment of the chicken glyceraldehyde 3-phosphate dehydrogenase gene 
[Dugaiczyk, 1983]. 

20 

Genomic DNA was isolated from cells using a proteinase K-sodium dodecyl sulfate 
procedure essentially as described. Fifteen micrograms of DNA was digested with either 
BamH I or Sac I, fractionated on a 0.8%(w/v) agarose gel, transferred to GeneScreenPlus 
membrane (Du Pont NEN, Boston MA), prehybridised, hybridised with random-primed ^^P- 
25 labelled DNA fragments and washed essentially as described [Sambrook, 1989], 

Total RNA was isolated from cells and tissues using Trizol Reagent, as recommended by the 
manufacturer (GibcoBRL, Grand Island, NY). When required polyA+ mRNA was purified 
essentially as described [Alexander, 1995]. Northern blots were prehybridised, hybridized 
30 with random-primed 32P-labelIed DNA fragments and washed as described [Alexander, 



P:\OPER\EJH\SOCS.ILPRV - 14/2/97 



-36- 

1995]. 

To 8SSCSS the induction of SOCS gcn&s by IIy-6, mice (C57BL6) were injected intravenously 
with 5 fxg IL-6 followed by harvest of the liver at the indicated timepoints after injection. 
5 Ml cells were cultured in the presence of 20 ng/ml IL-6 and harvested at the indicated 
times. For RT-PCR analysis, bone marrow cells were harvested as described (Metacalf et 
al, 1995) and stimulated for 1 hr at ST'^C with 100 ng/ml of a range of cytokines. RT-PCT 
was performed on total RNA as described (Metcalf et al, 1995). PGR products were 
resolved on an agarose gel and Southern blots were hybridised with probes specific for each 
10 SOCS family member. Expression of P-actin was assessed to ensure uniformity of 
amplification, 

EXAMPLE 8 
DNA CONSTRUCTS AND TRANSFECTION 

15 A cDNA encoding epitope-tagged SOCSl was generated by subcloning the entire SOCSl 
coding region into the pEF-BOS e?q)ression vector [Mizushima, 1990], engineered to encode 
an inframe FLAG epitope downstream of an initiation methionine (pF-SOCSl). Using 
electroporation as described previously [Hilton, 1994], Ml cells expressing the 
thrombopoietin receptor (Ml.mpl) were transfected with the 20 pig of Aat Il-digested pF- 

20 SOCSl expression plasmid and 2 /zg of a Sea I-digested plasmid in which transcription of 
a cDNA encoding puromycin N-acetyl transferase was driven from the mouse 
phosphoglycerokinase promoter (pPGKPuropA). After 48 hours in culture, transfected cells 
were selected with 20 /xg/ml puromycin (Sigma Chemical Company, St Louis MO), and 
screened for expression of SOCSl by Western blottmg, usmg the M2 anti-FLAG 

25 monoclonal antibody according to the manafacturer's instructions (Eastman Kodak, 
Rochester NY). In otiier experiments Ml cells were transfected with only the pF-SOCSl 
plasmid or a control and selected by their ability to grow in agar in the presence of 100 
ng/ml of IL-6. 

30 
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IMMUNOPRECIPITATION AND WESTERN BLOTTING 

Prior to either immunoprecipitaion or Western blotting, 10'' Ml ceils or their derivatives 
were washed twice, resuspended in 1 ml of DME, and incubated at 37°C for 30 min. The 
5 cells were then stimulated for 4 min at 37°C with either saline or 100 ng/ml IL-6, after 
which sodium vanadate (Sigma Chemical Co., St Louis, MI) was added to a concentration 
of 1 mM. Cells were placed on ice, washed once with saline containing 1 mM sodium 
vanadate, and then solubilised for 5 min on ice with 300 ptl 1 % (v/v) Triton X-100, 150 mM 
NaCl, 2 mM EDTA, 50 mM Tris-HCl pH 7.4, containing Complete protease inhibitors 
10 (Boehringer Mannheim, Mannheim, Germany) and 1 mM sodium vanadate. Lysates were 
cleared by centrifiigation and quantitated using a Coomassie Protein Assay Reagent (Pierce, 
Rockford IL). 

For immunoprecipitations, equal concentrations of protein extracts (1-2 mg) were incubated 
15 for 1 hr or overnight at 4°C with either 4 fig of anti-gpl30 antibody (M20; Santa Cruz 
Biotechnology Inc., Santa Cruz, CA) or 4 fig of anti-phosphotyrosine antibody (4G10; 
Upstate Biotechnology Inc., Lake Placid NY), and 15 fxl packed volume of Protein G 
Sepharose (Pharmacia, Uppsala, Sweden) [Hilton et al, 1996]. Immunoprecipitates were 
washed twice in 1 % (v/v) NP40, 150 mM NaCl , 50 mM Tris-HCl pH 8.0, contammg 
20 Complete protease inhibitors (Boehringer Mannheim, Mannheim, Germany and 1 mM 
sodium vanadate. The samples were heated for 5 min at 95 °C in SDS sample buffer (625 
mM Tris-HCl pH 6.8, 0.05% (w/v) SDS, 0.1 % (v/v) glycerol, bromophenol blue, 0.125% 
(v/v) 2-mercaptoethanol), fractionated by SDS-PAGE and immunoblotted as described 
above. 

25 

For Western blottmg, 10 /zg of protein from a cellular extract or material from an 
immunoprecipitation reaction was loaded onto 4-15% Ready gels (Bio-Rad Laboratories, 
Hercules CA), and resolved by sodium dodecyl sulfate polyacrylamide gel electrophoresis 
(SDS-PAGE). Proteins were transferred to PVDF membrane (Micron Separations Inc., 
30 Westborough MA) for 1 hr at 100 V. The membranes were probed with the following 
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primary antibodies; anti-tyrosine phosphorylated STAT3 (1:1000 dilution; New England 
Biolabs, Beverly, MA); anti-STAT3 (C-20; 1:100 dilution; Santa Cruz Biotechnology Inc., 
Santa Cruz CA); anti-gpl30 (M20, 1:100 dilution; Santa Cruz Biotechnology Inc., Santa 
Cruz CA); anti-phosphotyrosine (horseradish peroxidase-conjugated RC20, 1:5000 dilution; 
5 Transduction Laboratories, Lexington KY); anti-tyrosine phosphorylated MAP kinase and 
anti-MAP kinase antibodies (1:1000 dilution; New England Biolabs, Beverly, MA). Blots 
were visualised using peroxidase-conjugated secondary antibodies and Enhanced 
Chemiluminescence (ECL) reagents according to the manafacturer's instructions (Pierce, 
Rockford IL). 

10 

EXAMPLE 10 
ELECTROPHORETIC MOBILITY SHIFT ASSAYS 
Assays were performed as described [Novak, 1995], using the high affinity SIF (c-sis- 
inducible factor) binding site m67 [Wakao, 1994]. Protein extracts were prepared from Ml 

15 cells incubated for 4-10 min at 37°C in 10 ml serum-free DME containing either saline, 100 
ng/ml IL-6 or 100 ng/ml IFN-y. The binding reactions contained 4-6 jxg protein (constant 
withm a given experiment), 5 ng ^^P-labelled m67 oligonucleotide, and 800 ng sonicated 
salmon sperm DNA. For certain experiments, protein samples were preincubated with an 
excess of unlabelled rii67 oligonucleotide, or antibodies specific for either STATl 

20 (Transduction Laboratories, Lexington, KY) or STAT3 (Santa Cruz Biotechnology Inc., 
Santa Cruz CA), as described [Novak, 1995]. 

Western blots were performed usmg anti-tyrosine phosphorylated STAT3 or anti-STAT3 
(New England Biolabs, Beverly, MA) or anit-gpl30 (Santa Cruz Biotechnology Inc.) as 
25 described (Nicola et al, 1996). EMS A were performed using the m67 oligonucleotide 
probe, as described (Novak et al, 1995). 
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EX AMPLE 11 

EXPRESSION CLONING OF A NOVEL SUPPRESSOR OF 
CYTOKINE SIGNAL TRANSDUCTION 

In order to identify cDNAs capable of suppressing cytokine signal transduction, an 
5 expression cloning approach was adopted. This strategy centred on Ml cells, a monocytic 
leukaemia cell line that differentiates into mature macrophages and ceases proliferation in 
response to the cytokines IL-6, LIF, GSM and IFN-y, and the steroid dexamethasone. 
Parental Ml cells were infected with the RUFneo retrovirus, into which cDNAs from the 
factor-dependent haemopoietic cell line FDC-Pl had been cloned. In this retrovirus, 

10 transcription of both the neomycin resistance gene and the cloned cDNA was driven off the 
powerful constitutive promoter present in the retroviral LTR (Figure 1). When cultured in 
semi-solid agar, parental Ml cells form large tightly packed colonies. Upon stimulation 
with IL-6, Ml cells undergo rapid differentiation, resulting in the formation in agar of only 
single macrophages or small dispersed clusters of cells . Retrovirally- infected Ml cells that 

15 were unresponsive to IL-6 were selected in semi-solid agar culture by their ability to form 
large, tightly packed colonies in the presence of IL-6 and geneticin. A single stable IL-6- 
unresponsive clone, 4A2, was obtained after examming 10^ infected cells. 

A fragment of the neom'ycin phosphotransferase (neo) gene was used to probe a Southern 
20 blot of genomic DNA from clone 4A2 and this revealed that the cell line was infected with 
a single retrovirus containing a cDNA approximately 1.4 kbp in length (Figure 2). PGR 
amplification using primers from the retroviral vector which flanked the cDNA cloning site 
enabled recovery of a 1.4 kbp cDNA insert, which we have named suppressor of cytokine 
signalling- 1, or SOCSl. This PGR product was used to probe a sunilar Southern blot of 
25 4A2 genomic DNA and hybridised to two fragments, one which corresponded to the 
endogenous SOCSl gene and the other, which matched the size of the band seen using the 
neo probe, corresponded to the SOCSl cDNA cloned into the integrated retrovirus (Figure 
2). The latter was not observed in an Ml cell clone infected with a retrovirus containing an 
irrelevant cDNA. Similarly, Northern blot analysis revealed that SOCSl mRNA was 
30 abundant in tEe cell Ime 4A2tT)ut not'in the control infected Ml cell clone (Figure 2). 
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EXAMPLE 12 

SOCSl, SOCS2, SOCS3 AND CIS DEFINE A NEW FAMILY 
OF SH2-CONTAINING PROTEINS 

The SOCSl PCR product was used as a probe to isolate homologous cDNAs from a mouse 
5 thymus cDNA library. The sequence of the cDNAs proved to be identical to the PCR 
product, suggesting that constitutive or over ejq^ression, rather than mutation, of the SOCSl 
protein was sufficient for generating an IL-6-unresponsive phenotype. Comparison of the 
sequence of SOCSl cDNA with nucleotide sequence databases revealed that it was present 
on mouse and rat genomic DNA clones containing the protamine gene cluster found on 

10 mouse chromosome 16. Closer inspection revealed that the 1.4 kb SOCSl sequence was not 
homologous to any of the protamine genes, but rather represented a previously unidentified 
open reading frame located at the extreme 3' end of these clones (Figure 3). There were no 
regions of discontinuity between the sequences of the SOCSl cDNA and genomic locus, 
suggesting that SOCSl is encoded by a single exon. In addition to the genomic clone 

15 containing the protamine genes, a series of murine and human expressed sequenced tags 
(ESTs) also revealed large blocks of nucleotide sequence identity to mouse SOCSl. The 
sequence information provided by the human ESTs allowed the rapid cloning of cDNAs 
encoding human SOCSl. 

20 The mouse and rat SOCSl gene encodes a 212 amino acid protein whereas the human 
SOCSl gene encodes a 211 amino acid protein. Mouse, rat and human SOCSl proteins 
share 95-99% amino acid identity (Figure 9). A search of translated nucleic acid databases 
with the predicted amino acid sequence of SOCSl showed that it was most related to a 
recently cloned cytokine-inducible immediate early gene product, CIS, and two classes of 

25 ESTs. Full length cDNAs from the two classes of ESTs were isolated and found to encode 
proteins of similar length and overall structure to SOCS 1 and CIS. These clones were given 
the names S0CS2 and SOCS3. Each of the four proteins contains a central SH2 domain and 
a C-terminal region termed the SOCS motif. The SOCSl protems exhibit an extremely high 
level of amino acid sequence similarity (95-99% identity) amongst different species. 

30 However, the 'forms of the s5CSr, ^0CS2, S0CS3 and CIS from the same animal, while 
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clearly defining a new family of SH2-containing proteins, exhibited a lower amino acid 
identity. S0CS2 and CIS exhibit approximately 38% amino acid identity, while the 
remaining members of the family share approximately 25% amino acid identity (Figure 9). 
The coding region of the genes for SOCSl and SOC3 appear to contain no introns while the 
5 coding region of the genes for SOCS2 and CIS contain one and two introns, respectively. 

The Genbank Acession Numbers for the sequences referred to herein are mouse SOCSl 
cDNA (U88325), human SOCSl cDNA (U88326), mouse SOCS2 cDNA (U88327), mouse 
SOCS3 cDNA (U88328). 

10 

EXAMPLE 13 

CONSTITUTIVE EXPRESSION OF SOCSl SUPPRESSES THE 
ACTION OF A RANGE OF CYTOKINES 

To formally establish that the phenotype of the 4A2 cell line was directly related to 
15 expression of SOCSl, and not to unrelated genetic changes which may have occurred 
independently in these cells, a cDNA encoding an epitope-tagged version of SOCSl under 
the control of the EFlcx promoter was transfected into parental Ml cells, and Ml cells 
expressing the receptor for thrombopoietin, c-mpl (Ml.mpl). Transfection of the SOCSl 
expression vector into both cell lines resulted in an increase in the frequency of IL-6 
20 unresponsive Ml cells. 

Multiple independent clones of Ml cells expression SOCSl, as detected by Western blot, 
displayed a cytokine-unresponsive phenotype that was mdistinguishable from 4A2. Further, 
if transfectants were not maintained in puromycin, expression of SOCSl was lost over time 
25 and cells regained their cytokine responsiveness. In the absence of cytokine, colonies 
derived from 4A2 and other SOCSl expressmg clones characteristically grew to a smaller 
size than colones formed by control Ml cells (Figure 10). 

The effect of constitutive SOCSl expression on the response of Ml cells to a range of 
30 cytokines was investigated using the 4A2 cell line and a clone of Ml.mpl cells expressing 
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SOCSl (Ml.mpl.SOCSl). Unlike parental Ml cells and Ml.mpl cells, the two cell lines 
expressing SOCSl continued to proliferate and failed to form differentiated colonies in 
response to either IL-6, LIF, OSM, IFN-y or, in the case of the Ml.mpl.SOCSl cell line, 
thrombopoietin (Figure 4). For both cell lines, however, a normal response to 
5 dexamethasone was observed, suggesting that SOCSl specifically affected cytokine signal 
transduction rather than differentiation per se. Consistent with these data, while parental Ml 
cells and Ml.mpl cells became large and vacuolated in response to IL-6, 4A2 and 
Ml.mpl.SOCSl cells showed no evidence of morphological differentiation in response to 
IL-6 or other cytokines (Figure 5). 

10 

EXAMPLE 14 

SOCSl INHIBITS A RANGE OF IL-6 SIGNAL TRANSDUCTION 
PROCESSES, INCLUDING STAT3 PHOSPHORYLATION 

AND ACTIVATION 

15 Phosphorylation of the cell surface receptor component gpl30, the cytoplasmic tyrosine 
kinase JAKl and the transcription factor STAT3 is thought to play a central role in IL-6 
signal transduction. These events were compared in the parental Ml and Ml.mpl cell lines 
and dieir SOCSl -expressing counterparts. As expected, gplBO was phosphorylated rapidly 
in response to IL-6 in both parental lines, however, this was reduced five- to ten- fold in the 

20 cell lines expressing SOCSl (Figure 6). Likewise, STAT3 phosphorylation was also 
reduced by approximately ten-fold in response to IL-6 in those cell lines expressing SOCSl 
(Figure 6). Consistent with a reduction in STAT3 phosphorylation, activation of specific 
STAT DNA binding complexes, as determined by electrophoretic mobility shift assay, was 
also reduced. Notably, there was a reduction in the formation of SIF-A (containing 

25 STAT3), SIF-B (STAT1/STAT3 heteroduner) and SIF-C (containing STATl), the three 
STAT complexes induced in Ml cells stimulated widi IL-6 (Figure 7). Sunilarly, 
constitutive expression of SOCSl also inhibited IFN-y-stimulated formation of p91 
homodimers (Figure 7). STAT phosphorylation and activation were not the only 
cytoplasmic processes to be effected by SOCSl expression, as the phosphorylation of other 

30 proteins, mcluding she and MAP kinase, was reduced to a similar extent (Figure 7). 
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EXAMPLE 15 

TRANSCRIPTION OF THE SOCSl GENE IS STIMULATED BY IL-6 

IN VITRO AND IN VIVO 

Although SOCSl can mhibit cytokine signal transduction when constitutively expressed in 
5 Ml cells, this does not necessarily indicate that SOCSl normally functions to negatively 
regulate an IL-6 response. In order to investigate this possibility the inventors determined 
whether transcription of the SOCSl gene is regulated in the response of Ml cells to IL-6 
and, because of the critical role IL-6 plays in regulating the acute phase response to injury 
and infection, the response of the liver to intravenous injection of 5 mg IL-6. In the absence 

10 of IL-6, SOCSl mRNA was undetectable in either Ml cells or in the liver. However, for 
both cell types, a 1.4 kb SOCSl transcript was induced within 20 to 40 minutes by IL-6 
(Figure 8). For Ml cells, where the IL-6 was present throughout the experiment, the level 
of SOCSl mRNA remained elevated (Figure 8). In contrast, IL-6 was administered in vivo 
by a single intravenous injection and was rapidly cleared from the circulation, resulting in 

15 a pulse of IL-6 stimulation to the liver. Consistent with this, transient expression of SOCSl 
mRNA was detectable in the liver, peaking approximately 40 minutes after injection and 
declining to basal levels within 4 hours (Figure 8). 

EXAMPLE 16 

20 REGULATION OF SOCS GENES 

Since CIS was cloned as a cytokine-inducible immediate early gene the inventors examined 
whether SOCSl, SOCS2 and SOCS3 were similarly regulated. The basal pattern of 
expression of the four SOCS genes was examined by Northern blot analysis of mRNA from 

25 a variety of tissues from male and female C57B1/6 mice (Figure 11 A). Constitutive 
expression of SOCSl was observed in the thymus and to a lesser extend in the spleen and 
the lung. S0CS2 expression was restricted primarily to the testis and in some animals the 
liver and lung; for SOCS3 a low level of expression was observed in the lung, spleen and 
thymxis, while CIS expression was more widespread, including the testis, heart, lung, kidney 

30 and, in some animals, the liver. 
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The inventors sought to determine whether expression of the four SOCS genes was regulated 
by IL-6. Northern blots of mRNA prepared from the livers of untreated and IL-6-injected 
mice, or from unstimulated and IL-6-stimulated Ml cells, were hybridised with labelled 
fragments of SOCSl, SOCS2, SOCS3 and CIS cDNAs (FiGure IIB). Expression of all 
5 four SOCS genes was increased in the liver following IL-6 injection, however the kinetics 
of induction appeared to differ. Expression of SOCSl and SOCS3 was transient in the liver, 
with mRNA detectable after 20 minutes of IL-6 injection and declining to basal levels within 
4 hours for SOCS and 8 hours for SOCS3. Induction of S0CS2 and CIS mRNA m the liver 
followed similar initial kinetics to that of SOCSl, but was maintained at an elevated level 
10 for at least 24 hours. A similar induction of SOCS gene mRNA was observed in other 
organs, notably die lung and the spleen. In contrast, in Ml cells, while SOCSl and CIS 
mRNA were induced by IL-6, no induction of either SOCS2 or SOCS3 expression was 
detected. This result highlights cell type-specific differences in the expression of the genes 
of SOCS family members in response to the same cytokine. 

15 

In order to examine the spectrum of cytokines that was capable of inducing transcription of 
the various members of the SOCS gene family, bone marrow cells were stimulated for an 
hour with a range of cytokines, after which mRNA was extracted and cDNA was 
synthesised. PCR was then used to assess the expression of SOCSl, S0CS2, SOCS3 and 

20 CIS (Figure IIC). In the absence of stimulation, little or no expression of any of the SOCS 
genes was detectable in bone marrow by PCR. Stunulation of bone marrow cells with a 
broad array of cytokines appeared capable of up regulating mRNA for one or more members 
of the SOCS family, IFNy, for example, induced expression of all four SOCS genes, while 
erythropoietin, granulocyte colony-stimulating factor, granulocyte-macrophage colony 

25 stimulating factor and mterleukin-3 induced expression of SOCS2, SOCS3 and CIS. 
Interestingly, tumor necrosis factor alpha, macrophage colony-stimulating factor and 
interleukin-1, which act through receptors that do not fall into the type I cytokine receptor 
class also appeared capable of inducing expression of SOCS3 and CIS, suggesting that SOCS 
proteins may play a broader role in regulating signal transduction. 
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As constitutive expression of SOCSl inhibited the response of Ml cells to a range of 
cytokines, the inventors examined whether phosphorylation of the cell surface receptor 
component gpl30 and the transcription factor STA3, which are though to play a central role 
in IL-6 signal transduction, were affected. These events were compared in the parental Ml 
5 and Ml.mpl cell lines and their SOCSl -expressing counterparts. As expected, gpl30 was 
phyosphorylafed rapidly in response to IL-6 in both parental lines, however, this was 
reduced in the cell Imes expressing SOCSl (Figure 12A). Likewise, STATS 
phosphorylation was also reduced in response to IL-6 in those cell lines expressing SOCSl 
(Figure 12A). Consistent with a reduction in STAT3 phosphorylation, activation of specific 

10 STA/DNA binding complexes, as determined by electrophoretic mobility shift assay, was 
also reduced. Notably, there was a failure to form SIF-A (containing STAT3) and SIF- 
B(STAT1/STAT3 heterodimer), the major STAT complexes induced in Ml cells stimulated 
with IL-6 (Figure 12B). Similarly, constitutive expression of SOCSl also inhibited IFNy- 
stimulating formation of SIF-C (STATl homodimer; Figure 12B). These experiments are 

1 5 consistent with the proposal that SOCSl inhibits signal transduction upstream of receptor and 
STAT phosphorylation, potentially at the level of the JAK kinases. 

The ability of SOCSl to inhibit signal transduction and ultimately the biological response 
to cytokines suggest that', like the SH2-containing phosphatase SHP-1 (Ihle et al, 1994; Yi 

20 et al, 1993), the SOCS proteins may play a central role in controlling the intensity and/or 
duration of a cell's response to a diverse range of extracellular stimuli by suppressing the 
signal transduction process. The evidence provided here indicates that the SOCS family acts 
in a classical negative feedback loop for cytokine signal transduction. Like other genes such 
as OSM, expression of genes encoding the SOCS proteins is induced by cytokines through 

25 the activation of STATs. Once expressed, it is proposed that the SOCS proteins inhibit the 
activity of JAKs and so reduce the phosphorylation of receptors and STATs, thereby 
suppressing signal transduction and any ensuing biological response. Importantly, inhibition 
of STAT activation will, over time, lead to a reduction in SOCS gene expression, allowmg 
cells to regain responsiveness to cytokines. 
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Those skilled in the art will appreciate that the invention described herein is susceptible to 
variations and modifications other than those specifically described. It is to be understood that 
the invention includes all such variations and modifications. The invention also includes all 
of the steps, features, compositions and compounds referred to or indicated in this 
5 specification, individually or collectively, and any and all combinations of any two or more 
of said steps or features. 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 
(C?) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

' (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



CACGCCGCCC ACGTGAAGGC 2 0 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
TTCGCCAATG ACAAGACGCT 2 0 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .636 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



CGAGGCTCAA GCTCCGGGCG GATTCTGCGT GCCGCTCTCG CTCCTTGGGG TCTGTTGGCC 



-101 
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GGCCTGTGCC ACCCGGACGC CCGGCTCACT GCCTCTGTCT CCCCCATCAG CGCAGCCCCG -41 
GACGCTATGG CCCACCCCTC CAGCTGGCCC CTCGAGTAGG -1 



ATG GTA GCA CGC AAC CAG GTG GCA GCC GAC AAT GCG ATC TCC CCG GCA 4 8 

Met Val Ala Arg Asn Gin Val Ala Ala Asp Asn Ala lie Ser Pro Ala 
15 10 15 

GCA GAG CCC -bCGA CGG CGG TCA GAG CCC TCC TCG TCC TCG TCT TCG TCC 96 
Ala Glu Pro Arg Arg Arg Ser Glu Pro Ser Ser Ser Ser Ser Ser Ser 
20 25 30 

TCG CCA GCG GCC CCC GTG CGT CCC CGG CCC TGC CCG GCG GTC CCA GCC 144 
Ser. Pro Ala Ala Pro Val Arg Pro Arg Pro Cys Pro Ala Val Pro Ala 
35 40 45 

CCA GCC CCT GGC GAC ACT CAC TTC CGC ACC TTC CGC TCC CAC TCC GAT 192 
Pro Ala Pro Gly Asp Thr His Phe Arg Thr Phe Arg Ser His Ser Asp 
50 55 60 

TAC CGG CGC ATC ACG CGG ACC AGC GCG CTC CTG GAC GCC TGC GGC TTC 24 0 

Tyr Arg Arg He Thr Arg Thr Ser Ala Leu Leu Asp Ala Cys Gly Phe 
65 70 75 80 

TAT TGG GGA CCC CTG AGC GTG CAC GGG GCG CAC GAG CGG CTG CGT GCC 2 88 

Tyr Trp Gly Pro Leu Ser Val His Gly Ala His Glu Arg Leu Arg Ala 
85 90 95 

GAG CCC GTG GGC ACC TTC TTG GTG CGC GAC AGT CGT CAA CGG AAC TGC 33 6 

Glu Pro Val Gly Thr Phe Leu Val Arg Asp Ser Arg Gin Arg Asn Cys 
100 105 110 

TTC TTC GCG CTC AGC GTG AAG ATG GCT TCG GGC CCC ACG AGC ATC CGC 3 84 

Phe Phe Ala Leu Ser Val Lys Met Ala Ser Gly Pro Thr Ser He Arg 
115 120 125 

GTG CAC TTC CAG GCC GGC CGC TTC CAC TTG GAC GGC AGC CGC GAG ACC 432 
Val His Phe Gin Ala Gly Arg Phe His Leu Asp Gly Ser Arg Glu Thr 
130 135 140 

TTC GAC TGC CTT TTC GAG CTG CTG GAG CAC TAC GTG GCG GCG CCG CGC 48 0 

Phe Asp Cys Leu Phe Glu Leu Leu Glu His Tyr Val Ala Ala Pro Arg 
145 150 155 160 

CGC ATG TTG GGG GCC CCG CTG CGC CAG CGC CGC GTG CGG CCG CTG CAG 52 8 

Arg Met Leu Gly Ala Pro Leu Arg Gin Arg Arg Val Arg Pro Leu Gin 
165 170 175 

GAG CTG TGT CGC CAG CGC ATC GTG GCC GCC GTG GGT CGC GAG AAC CTG 576 
Glu Leu Cys Arg Gin Arg He Val Ala Ala Val Gly Arg Glu Asn Leu 
180 185 190 



GC(S^ CGC ffrC-CCT CTT AAC "fTCG GTk CTC CGT GAC TAC CTG AGT TCC TTC 624 
Ala Arg He Pro Leu Asn Pro Val Leu Arg Asp Tyr Leu Ser Ser Phe 
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195 200 205 

CCC TTC CAG ATC TGA CCGGCTG CCGCTGTGCC GCAGCATTAA GTGGGGGCGC 6 76 
Pro Phe Gin lie * 
210 

CTTATTATTT CTTATTATTA ATTATTATTA TTTTTCTGGA ACCACGTGGG AGCCCTCCCC 73 6 

GCCTGGGTCG GAGGGAGTGG TTGTGGAGGG TGAGATGCCT CCCACTTCTG GCTGGAGACC 79 6 

TCATCCCACC TCTCAGGGGT GGGGGTGCTC CCCTCCTGGT GCTCCCTCCG GGTCCCCCCT 856 

GGTTGTAGCA GCTTGTGTCT GGGGCCAGGA CCTGAATTCC ACTCCTACCT CTCCATGTTT 916 

ACATATTCCC AGTATCTTTG CACAAACCAG GGGTCGGGGA GGGTCTCTGG CTTCATTTTT 97 6 

CTGCTGTGCA GAATATCCTA TTTTATATTT TTACAGCCAG TTTAGGTAAT AAACTTTATT 103 6 

ATGAAAGTTT TTTTTTAAAA GAAAAAAAAA AAAAAAAAA 1075 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



Met Val Ala Arg Asn Gin Val Ala Ala Asp Asn Ala lie Ser Pro Ala 
15 10 15 

Ala Glu Pro Arg Arg Arg Ser Glu Pro Ser Ser Ser Ser Ser Ser Ser 
20 25 30 

Ser Pro Ala Ala Pro Val Arg Pro Arg Pro Cys Pro Ala Val Pro Ala 
35 40 45 

Pro Ala Pro Gly Asp Thr His Phe Arg Thr Phe Arg Ser His Ser Asp 
50 55 60 

Tyr Arg Arg lie Thr Arg Thr Ser Ala Leu Leu Asp Ala Cys Gly Phe 
65 70 75 80 

Tyr Trp Gly Pro Leu Ser Val His Gly Ala His Glu Arg Leu Arg Ala 
85 90 95 

Glu Pro Val Gly Thr Phe Leu Val Arg Asp Ser Arg Gin Arg Asn Cys 

^ * ' *ioo ^ ao5 110 
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Phe Phe Ala Leu Ser Val Lys Met Ala Ser Gly Pro Thr Ser lie Arg 
115 120 125 

Val His Phe Gin Ala Gly Arg Phe His Leu Asp Gly Ser Arg Glu Thr 
130 135 140 

Phe Asp Cys Leu Phe Glu Leu Leu Glu His Tyr Val Ala Ala Pro Arg 
145 150 155 160 

Arg Met Leu X31y Ala Pro Leu Arg Gin Arg Arg Val Arg Pro Leu Gin 
165 170 175 

Glu Leu Cys Arg Gin Arg lie Val Ala Ala Val Gly Arg Glu Asn Leu 
180 185 190 

Ala Arg lie Pro Leu Asn Pro Val Leu Arg Asp Tyr Leu Ser Ser Phe 
195 200 205 

Pro Phe Gin lie 
210 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 223.. 819 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCGATCTGTG GGTGACAGTG TCTGCGAGAG ACTTTGCCAC ACCATTCTGC CGGAATTTGG 6 0 

AGAAAAAGAA CCAGCCGCTT CCAGTCCCCT CCCCCTCCGC CACCATTTCG GACACCCTGC 12 0 

ACACTCTCGT TTTGGGGTAC CCTGTGACTT CCAGGCAGCA CGCGAGGTCC ACTGGCCCCA 18 0 

GCTCGGGCGA CCAGCTGTCT GGGACGTGTT GACTCATCTC CC ATG ACC CTG CGG 234 

Met Thr Leu Arg 
1 

TGC CTG GAG CCC TCC GGG AAT GGA GCG GAC AGG ACG CGG AGC CAG TGG 2 82 

Cys Leu Glu Pro Ser Gly Asn Gly Ala Asp Arg Thr Arg Ser Gin Trp 
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GGG ACC GCG GGG TTG CCG GAG GAA CAG TCC CCC GAG GCG GCG CGT CTG 33 0 

Gly Thr Ala Gly Leu Pro Glu Glu Gin Ser Pro Glu Ala Ala Arg Leu 
25 30 35 

GCG AAA GCC CTG CGC GAG CTC AGT CAA ACA GGA TGG TAC TGG GGA AGT 3 78 

Ala Lys Ala Leu Arg Glu Leu Ser Gin Thr Gly Trp Tyr Trp Gly Ser 
40 45 50 

ATG ACT GTT AAT GAA GCC AAA GAG AAA TTA AAA GAG GCT CCA GAA GGA 42 6 

Met Thr Val tAsn Glu Ala Lys Glu Lys Leu Lys Glu Ala Pro Glu Gly 
55 60 65 

ACT TTC TTG ATT AGA GAT AGT TCG CAT TCA GAC TAC CTA CTA ACT ATA 4 74 

Thr Phe Leu lie Arg Asp Ser Ser His Ser Asp Tyr Leu Leu Thr lie 
' 70 75 80 

TCC GTT AAG ACG TCA GCT GGA CCG ACT AAC CTG CGG ATT GAG TAC CAA 522 
Ser Val Lys Thr Ser Ala Gly Pro Thr Asn Leu Arg lie Glu Tyr Gin 
85 90 95 100 

GAT GGG AAA TTC AGA TTG GAT TCT ATC ATA TGT GTC AAG TCC AAG CTT 57 0 

Asp Gly Lys Phe Arg Leu Asp Ser lie lie Cys Val Lys Ser Lys Leu 
105 110 115 

AAA CAG TTT GAC AGT GTG GTT CAT CTG ATT GAC TAC TAT GTC CAG ATG 618 
Lys Gin Phe Asp Ser Val Val Hie Leu lie Asp Tyr Tyr Val Gin Met 
120 125 130 

TGC AAG GAT AAA CGG ACA GGC CCA GAA GCC CCA CGG AAT GGG ACT GTT 666 
Cys Lys Asp Lys Arg Thr Gly Pro Glu Ala Pro Arg Asn Gly Thr Val 
135 140 145 

CAC CTG TAC CTG ACC AAA CCT CTG TAT ACA TCA GCA CCC ACT CTG CAG 714 
His Leu Tyr Leu Thr Lys Pro Leu Tyr Thr Ser Ala Pro Thr Leu Gin 
150 155 160 

CAT TTC TGT CGA CTC GCC ATT AAC AAA TGT ACC GGT ACG ATC TGG GGA 762 
His Phe Cys Arg Leu Ala lie Asn Lys Cys Thr Gly Thr lie Trp Gly 
165 170 175 180 

CTG CCT TTA CCA ACA AGA CTA AAA GAT TAC TTG GAA GAA TAT AAA TTC 810 
Leu Pro Leu Pro Thr Arg Leu Lys Asp Tyr Leu Glu Glu Tyr Lys Phe 
185 190 195 

CAG GTA TAAGTATTTC TCTCTCTTTT TCGTTTTTTT TTAAAAAAAA AAAAACACAT 8 66 

Gin Val 



GCCTCATATA GACTATCTCC GAATGCAGCT ATGTGAAAGA GAACCCAGAG GCCCTCCTCT 92 6 

GGATAACTGC GCAGAATTCT CTCTTAAGGA CAGTTGGGCT CAGTCTAACT TAAAGGTGTG 98 6 

AAGATGTAGC TAGGTATTTT AAAGTTCCCC TTAGGTAGTT TTAGCTGAAT GATGCTTTCT 104 6 

TTCCTATGGC TGCTCAAGAT CAAATGGCCC TTTTAAATGA AACAAAACAA AACAAAACAA 1106 
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AAAAAAAAAA AAAAA 1121 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met'Thr Leu Arg Cys Leu Glu Pro Ser Gly Aen Gly Ala Asp Arg Thr 
15 10 15 

Arg Ser Gin Trp Gly Thr Ala Gly Leu Pro Glu Glu Gin Ser Pro Glu 
20 25 30 

Ala Ala Arg Leu Ala Lys Ala Leu Arg Glu Leu Ser Gin Thr Gly Trp 
35 40 45 

Tyr Trp Gly Ser Met Thr Val Asn Glu Ala Lys Glu Lys Leu Lys Glu 
50 55 60 

Ala Pro Glu Gly Thr Phe Leu lie Arg Asp Ser Ser His Ser Asp Tyr 
65 70 75 80 

Leu Leu Thr lie Ser Val Lys Thr Ser Ala Gly Pro Thr Asn Leu Arg 
85 90 95 

lie Glu Tyr Gin Asp Gly Lys Phe Arg Leu Asp Ser lie lie Cys Val 
100 ' 105 110 

Lys Ser Lys Leu Lys Gin Phe Asp Ser Val Val His Leu lie Asp Tyr 
115 120 125 

Tyr Val Gin Met Cys Lys Asp Lys Arg Thr Gly Pro Glu Ala Pro Arg 
130 135 140 

Asn Gly Thr Val His Leu Tyr Leu Thr Lys Pro Leu Tyr Thr Ser Ala 
145 150 155 160 

Pro thr Leu Gin His Phe Cys Arg Leu Ala lie Asn Lys Cys Thr Gly 
165 170 175 

Thr He Trp Gly Leu Pro Leu Pro Thr Arg Leu Lys Asp Tyr Leu Glu 
180 185 190 

Glu Tyr Lys Phe Gin Val 
195 

(2) INFORMATION FOR SEQ ID NO : 7 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 18.. 695 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CGCTGGCTCC GTGCGCC ATG GTC ACC CAC AGC AAG TTT CCC GCC GCC GGG 5 0 

Met Val Thr His Ser Lys Phe Pro Ala Ala Gly 
15 10 

ATG AGC CGC CCC CTG GAC ACC AGC CTG CGC CTC AAG ACC TTC AGC TCC 98 
Met Ser Arg Pro Leu Asp Thr Ser Leu Arg Leu Lys Thr Phe Ser Ser 
15 20 25 

AAA AGC GAG TAC CAG CTG GTG GTG AAC GCC GTG CGC AAG CTG CAG GAG 14 6 

Lys Ser Glu Tyr Gin Leu Val Val Asn Ala Val Arg Lys Leu Gin Glu 
30 35 40 

AGC GGA TTC TAC TGG AGC GCC GTG ACC GGC GGC GAG GCG AAC CTG CTG 194 
Ser Gly Phe Tyr Trp Ser Ala Val Thr Gly Gly Glu Ala Asn Leu Leu 
45 50 55 

CTC AGC GCC GAG CCC GCG GGC ACC TTT CTT ATC CGC GAC AGC TCG GAC 242 
Leu Ser Ala Glu Pro Ala Gly Thr Phe Leu lie Arg Asp Ser Ser Asp 
60 65 70 75 

CAG CGC CAC TTC TTC ACG TTG AGC GTC AAG ACC CAG TCG GGG ACC AAG 29 0 

Gin Arg His Phe Phe Thr Leu Ser Val Lys Thr Gin Ser Gly Thr Lys 
80 85 90 

AAC CTA CGC ATC CAG TGT GAG GGG GGC AGC TTT TCG CTG CAG AGT GAC 33 8 

Asn Leu Arg lie Gin Cys Glu Gly Gly Ser Phe Ser Leu Gin Ser Asp 
95 100 105 

CCC CGA AGC ACG CAG CCA GTT CCC CGC TTC GAC TGT GTA CTC AAG CTG 38 6 

Pro Arg Ser Thr Gin Pro Val Pro Arg Phe Asp Cys Val Leu Lys Leu 
110 115 120 

GTG CAC CAC TAC ATG CCG CCT CCA GGG ACC CCC TCC TTT TCT TTG CCA 434 
Val His His Tyr Met Pro Pro Pro Gly Thr Pro Ser Phe Ser Leu Pro 
125 130 135 

CCC ACG GAA CCC TCG TCC GAA GTT CCG GAG CAG CCA CCT GCC CAG GCA 4 82 

PrS Thr 5lu-Pro Ser Ser "t?lu Val -Pro Glu Gin Pro Pro Ala Gin Ala 
140 145 150 155 
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CTC CCC GGG AGT ACC CCC AAG AGA GCT TAG TAG ATC TAT TCT GGG GGC 53 0 

Leu Pro Gly Ser Thr Pro Lys Arg Ala Tyr Tyr lie Tyr Ser Gly Gly 
160 165 170 



GAG AAG ATT CCG CTG GTA CTG AGC CGA COT CTC TCC TCC AAC GTG GCC 57 8 

Glu Lys lie Pro Leu Val Leu Ser Arg Pro Leu Ser Ser Asn Val Ala 
175 180 185 



ACC CTC CAG CAT CTT TGT CGG AAG ACT GTC AAC GGC CAC CTG GAC TCC 62 6 

Thr Leu Gin t»His Leu Cys Arg Lys Thr Val Asn Gly His Leu Asp Ser 
190 195 200 

TAT GAG AAA GTG ACC CAG CTG CCT GGA CCC ATT CGG GAG TTC CTG GAT 6 74 

Tyr Glu Lys Val Thr Gin Leu Pro Gly Pro lie Arg Glu Phe Leu Asp 
.205 210 215 

CAG TAT GAT GCT CCA CTT TAAGGAGCAA AAGGGTCAGA GGGGGGCCTG 722 
Gin Tyr Asp Ala Pro Leu 
220 225 

GGTCGGTCGG TCGCCTCTCC TCCGAGGCAC ATGGCACAAG CACAAAAATC CAGCCCCAAC 7 82 

GGTCGGTAGC TCCCAGTGAG CCAGGGGCAG ATTGGCTTCT TCCTCAGGCC CTCCACTCCC 842 
GCAGAGTAGA GCTGGCAGGA CCTGGAATTC GTCTGAGGGG AGGGGGAGCT GCCACCTGCT 9 02 

TTCCCCCCTC CCCCAGCTCC AGCTTCTTTC AAGTGGAGCC AGCCGGCCTG GCCTGGTGGG 9 62 

ACAATACCTT TGACAAGCGG ACTCTCCCCT CCCCTTCCTC CACACCCCCT CTGCTTCCCA 102 2 

AGGGAGGTGG GGACACCTCC AAGTGTTGAA CTTAGAACTG CAAGGGGAAT CTTCAAACTT 108 2 

TCCCGCTGGA ACTTGTTTGC GCTTTGATTT GGTTTGATCA AGAGCAGGCA CCTGGGGGAA 114 2 

GGATGGAAGA GAAAAGGGTG TGTGAAGGGT TTTTATGCTG GCCAAAGAAA TAACCACTCC 12 02 

CACTGCCCAA CCTAGGTGAG GAGTGGTGGC TCCTGGCTCT GGGGAGAGTG GCAAGGGGTG 12 62 

ACCTGAAGAG AGCTATACTG GTGCCAGGCT CCTCTCCATG GGGCAGCTAA TGAAACCTCG 1322 

CAGATCCCTT GCACCCCAGA ACCCTCCCCG TTGTGAAGAG GCAGTAGCAT TTAGAAGGGA 1382 

GACAGATGAG GCTGGTGAGC TGGCCGCCTT TTCCAACACC GAAGGGAGGC AGATCAACAG 144 2 

ATGAGCCATC TTGGAGCCCA GGTTTCCCCT GGAGCAGATG GAGGGTTCTG CTTTGTCTCT 15 02 

CCTATGTGGG GCTAGGAGAC TCGCCTTAAA TGCCCTCTGT CCCAGGGATG GGGATTGGCA 15 62 

CACAAGGAGC CAAACACAGC CAATAGGCAG AGAGTTGAGG GATTCACCCA GGTGGCTACA 1622 

GGCCAGGGGA AGTGGCTGCA GGGGAGAGAC CCAGTCACTC CAGGAGACTC CTGAGTTAAC 16 82 

ACTGGGAAGA CATTGGCCAG TCCTAGTCAT CTCTCGGTCA GTAGGTCCGA GAGCTTCCAG 1742 

GCCCTGCACA GCCCTCCTTT CTCACCTGGG GGGAGGCAGG AGGTGATGGA GAAGCCTTCC 1802 
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CATGCCGCTC ACAGGGGCCT CACGGGAATG CAGCAGCCAT GCAATTACCT GGAACTGGTC 18 62 

CTGTGTTGGG GAGAAACAAG TTTTCTGAAG TCAGGTATGG GGCTGGGTGG GGCAGCTGTG 1922 

TGTTGGGGTG GCTTTTTTCT CTCTGTTTTG AATAATGTTT ACAATTTGCC TCAATCACTT 198 2 

TTATAAAAAT CCACCTCCAG CCCGCCCCTC TCCCCACTCA GGCCTTCGAG GCTGTCTGAA 2 04 2 

GATGCTTGAA AAACTCAACC AAATCCCAGT TCAACTCAGA CTTTGCACAT ATATTTATAT 2102 

TTATACTCAG AAAAGAAACA TTTCAGTAAT TTATAATAAA AGAGCACTAT TTTTTAATGA 216 2 

AAAAAAAAAA AAAAAAAAAA AAAAA 2187 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met Val Thr His Ser Lys Phe Pro Ala Ala Gly Met Ser Arg Pro Leu 
1 5 10. 15 

Asp Thr Ser Leu Arg Leu Lys Thr Phe Ser Ser Lys Ser Glu Tyr Gin 
20 25 30 

Leu Val Val Asn Ala Val Arg Lys Leu Gin Glu Ser Gly Phe Tyr Trp 
35 ' 40 45 

Ser Ala Val Thr Gly Gly Glu Ala Asn Leu Leu Leu Ser Ala Glu Pro 
50 55 60 

Ala Gly Thr Phe Leu lie Arg Asp Ser Ser Asp Gin Arg His Phe Phe 
65 70 75 80 

Thr Leu Ser Val Lys Thr Gin Ser Gly Thr Lys Asn Leu Arg lie Gin 
85 90 95 

Cys Glu Gly Gly Ser Phe Ser Leu Gin Ser Asp Pro Arg Ser Thr Gin 
100 105 110 

Pro Val Pro Arg Phe Asp Cys Val Leu Lys Leu Val His His Tyr Met 
115 120 125 

Pro Pro Pro Gly Thr Pro Ser Phe Ser Leu Pro Pro Thr Glu Pro Ser 
130 135 140 

Se? Glu Vai' Pro Glu Gin ^ro P fb -Ala Gin Ala Leu Pro Gly Ser Thr 
145 150 155 160 
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Pro Lys Arg Ala Tyr Tyr lie Tyr 
165 

Val Leu Ser Arg Pro Leu Ser Ser 
180 

Cys Arg Lys Thr Val Asn Gly His 
195 200 

Gin Leu Pro tGly Pro lie Arg Glu 
.210 215 



Ser Gly Gly Glu Lys lie Pro Leu 
170 175 

Asn Val Ala Thr Leu Gin His Leu 
185 190 

Leu Asp Ser Tyr Glu Lys Val Thr 
205 

Phe Leu Asp Gin Tyr Asp Ala Pro 
220 



Leu 
225 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



CTCCGGCTGG 


CCCCTTCTGT 


AGGATGGTAG 


CACACAACCA 


GGTGGCAGCC 


GACAATGCAG 


60 


TCTCCACAGC 


AGCAGAGCCC 


CGACGGCGGC 


CAGAACCTTC 


CTCCTCTTCC 


TCCTCCTCGC 


120 


CCGCGGCCCC 


CGCGCGCCCG 


CGGCCGTGCC 


CCGCGGTCCC 


GGCCCCGGCC 


CCCGGCGACA 


180 


CGCACTTCCG 


CACATTCCGT 


TCGCACGCCG 


ATTACCGGCG 


CATCACGCGC 


GCCAGCGCGC 


240 


TCCTGGACGC 


CTGCGGATTC 


TACTGGGGGC 


CCCTGAGCGT 


GCACGGGGCG 


CACGAGCGGC 


300 


TGCGCGCCGA 


GCCCGTGGGC 


ACCTTCCTGG 


TGCGCGACAG 


CCGCCAGCGG 


AACTGCTTTT 


360 


TCGCCCTTAG 


CGTGAAGATG 


GCCTCGGGAC 


CCACGAGCAT 


CCGCGTGCAC 


TTTCAGGCCG 


420 


GCCGCTTTCA 


CCTGGATGGC 


AGCCGCGAGA 


GCTTCGACTG 


CCTCTTCGAG 


CTGCTGGAGC 


480 


ACTACGTGGC 


GGCGCCGCGC 


CGCATGCTGG 


GGGCCCCGCT 


GCGCCAGCGC 


CGCGTGCGGC 


540 


CGCTGCAGGA 


GCTGTGCCGC 


CAGCGCATCG 


TGGCCACCGT 


GGGCCGCGAG 


AACCTGGCTC 


600 


GCATCCCCCT 


CAACCCCGTC 


CTCCGCGACT 


ACCTGAGCTC 


CTTCCCCTTC 


CAGATTTGAC 


660 


CGGCAGCGCC 


CGCCGTGCAC 


GCAGCATTAA 


CTGGGATGCC 


GTGTTATTTT 


GTTATTACTT 


720 


GCCTGGAACC 


ATGTGGGTAC 


CCTCCCCGGC 


CTGGGTTGGA 


GGGAGCGGAT 


GGGTGTAGGG 


780 


GCGAGGCGCC 


TCCCGCCCTC 


GGCTGGAGAC 


GAGGCCGCAG 


ACCCCTTCTC 


ACCTCTTGAG 


840 


GGGGTCCTCC 


CCCTCCTGGT 


GCTCCCTCTG 


GGTCCCCCTG 


GTTGTTGTAG 


CAGCTTAACT 


900 


GTgTCTGGAG 


' CCAGGACCTG 


AAbrCGCA^C- 


TCCTACCTCT 


TCATGTTTAC 


ATATACCCAG 


960 


TATCTTTGCA 


CAAACCAGGG 


GTTGGGGGAG 


GGTCTCTGGC 


TTTATTTTTC 


TGCTGTGCAG 


1020 
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AATCCTATTT TATATTTTTT AAAGTCAGTT TAGGTAATAA ACTTTATTAT GAAAGTTTTT 108 0 

TTTTTTAAAA AAAA 10 94 



(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 

( C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Val Ala His Asn Gin Val Ala Ala Asp Asn Ala Val Ser Thr Ala 
15 10 15 

Ala Glu Pro Arg Arg Arg Pro Glu Pro Ser Ser Ser Ser Ser Ser Ser 
20 ' 25 30 

Pro Ala Ala Pro Ala Arg Pro Arg Pro Cys Pro Ala Val Pro Ala Pro 
35 40 45 

Ala Pro Gly Asp Thr His Phe Arg Thr Phe Arg Ser His Ala Asp Tyr 

50 55 60 

Arg Arg lie Thr Arg Ala Ser Ala Leu Leu Asp Ala Cys Gly Phe Tyr 
65 70 75 80 

Trp Gly Pro Leu Ser Val His Gly Ala His Glu Arg Leu Arg Ala Glu 
85 90 95 

Pro Val Gly Thr Phe Leu Val Arg Asp Ser Arg Gin Arg Asn Cys Phe 
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100 105 110 

Phe Ala Leu Ser Val Lys Met Ala Ser Gly Pro Thr Ser lie Arg Val 
115 120 125 

His Phe Gin Ala Gly Arg Phe His Leu Asp Gly Ser Arg Glu Ser Phe 
13a 135 140 

Asp Cys Leu Phe Glu Leu Leu Glu His Tyr Val Ala Ala Pro Arg Arg 
145 150 155 160 

Met Leu Gly Ala Pro Leu Arg Gin Arg Arg Val Arg Pro Leu Gin Glu 
165 170 175 

Leu Cys Arg Gin Arg lie Val Ala Thr Val Gly Arg Glu Asn Leu Ala 
180 185 190 

Arg lie Pro Leu Asn Pro Val Leu Arg Asp Tyr Leu Ser Ser Phe Pro 
195 200 205 

Phe Gin lie 
210 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2807 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GG^AACCG^G ,GCGGGGAGAC CA{JGAGGC§:T TGGCCTCAGA GCTTCAGAGT CGCGTGGCAG 
CAAACAGAGA AACCTGTAGA GGGCAGTGTG CGTCACTTAG CTCAGGGAAG CTGCACGCGA 



60 
120 
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AACTCACCCG 


CCTTCATTCA 


TAAACATCGT 


CAGCTAGGCA 


CCTACTCCTG 


GGCTTTCAGG 


180 


ACAAACTGAA 


TCACGAAACC 


ACAGTGTCCT 


TAAAATAGGT 


CTGACCGCCT 


GAATCCCTGG 


240 


CCAAGGTGTG 


TACGGGGCAT 


GGGAGCCCTT 


GTGCAGAGAT 


GCTTGCAGGA 


GCCTTGAGGG 


300 


GCTCTGTAAG 


ACAGAGGCTA 


GGAAGACAAA 


GTTGGGGGCT 


ACAGCTTCTT 


GTCCTGCCCG 


360 


GGGCCTCAGT 


TTCTTCGGTT 


GCCCACGTAG 


GAGTGCAGAG 


AGTCCAGCCC 


CTGGGGACCC 


420 


AACCCAACCC 


CGCCCAGTTT 


CCGAGGAACT 


CGTCCGGGAG 


CGGGGGCGCC 


CCTCCCGCAC 


480 


CGCCTTAGGC 


OTCCTTTGAA 


GCCTCTGCGG 


TCAGGCCACC 


GCTTCCTGGG 


AAGCCCAAGC 


54 0' 


CAAGGCCAGG 


CCGAGTGGCC 


AACGGGAGGG 


GCCCGCGCGC 


GATTCTGGAG 


GAGGGCGGCG 


600 


GCCCCACAGG 


TCTCCAGGGC 


TGGCTAGCCG 


GGCTCCTAGA 


GCGGAGACTG 


CCAAGGCCTT 


660 


CGGGTCCTGG 


GCAGGAAGGA 


TCCTGGCAGG 


GAGGAGTTGC 


TTGGGGGGTG 


GGGGGGAAAG 


720 


GCTCCAGGCG 


CGGTGGAGCT 


CTGACCAGGA 


GAATGCACAC 


ACTCGGAGGG 


GAGGAGGCGT 


780 


GTCAGCCCCA 


AGCTAGCATC 


CCACCCGGGG 


AGCAGCGATG 


TGGGGCGAAG 


GTAGCCAGAG 


840 


CAAAAGAGCA 


GGCACCAGGT 


GACACGAAAC 


AGAAGATTCC 


GGGTAGAGCC 


AGAACCCCAG 


900 


AAGTCCCATT 


CAGGGAAGGT 


GCGAGGCGAG 


AACGAGTTAG 


GTGGACCCTC 


TCCAGGGGCA 


960 


GCCAAAGAAA 


TCTAAAGAGA 


ACCCGAAGGA 


CTTGCCGGAA 


AGAGAAACCG 


AAAGCGGCGG 


1020 


TGGGCGGGAT 


CGGTGGGCGG 


GGCCTCCCTG 


GTTTAAGAGC 


TTGATGCAGG 


GGCGGGCAGC 


1080 


AGCAGAGAGA 


ACTGCGGCCG 


TGGCAGCGGC 


ACGGCTCCCG 


GCCCCGGAGC 


ATGCGCGACA 


1140 


GCAGCCCCGG 


AACCCCCAGC 


CGCGGCGCCC 


CGCGTCCCGC 


CGCCAGGTGA 


GCCGAGGCAG 


1200 


CTGCGAAGGA 


GCAGGCGGGA 


GGGGATGGGA 


GGAAGGGGAG 


CAGAGCCTGG 


CAGGACTATC 


1260 


CTCGCAGACT 


GCATGGCGGG 


GTCGTGGATG 


CTATGCCTCT 


GGCGCCCGCC 


CCACCGGCTG 


1320 


GCCCAGGCGG 


CCCCTCGCGC 


GCGCGGGGCG 


CCGTCAGCCC 


CTCCTCTCCG 


GCCCTGAGCC 


1380 


CGGATCGTCC 


GCCCGGGTTC 


CAGTTCCCGG 


CGTGGCCAGT 


AGGCGGCAAC 


CGCGAGGCGG 


1440 


CAAGCCACCC 


AGCGGGGACG 


GCCTGGAGTC 


GGGCCCCTCT 


CCACGCCCCC 


TTCTCCACGC 


1500 


GCGCGGGGAG 


GCAGGGCTCC 


ACCGCCAGTC 


TGGAAGGGTT 


CCACATACAG 


GAACGGCCTA 


1560 


CTTCGCAGAT 


GAGCCCACCG 


AGGCTCAGGC 


TCCGGGCGGA 


TTCTGCGTGT 


CACCCTCGCT 


1620 


CCTTGGGGTC 


CGCTGGCCGG 


CCTGTGCCAC 


CCGGACGCCC 


GGTTCACTGC 


CTCTGTCTCC 


1680 


CCCATCAGCG 


CAGCCCCGGA 


CGCTATGGCC 


CACCCCTCCA 


GCTGGCCCCT 


CGAGTAGGAT 


1740 


GGTAGCACGT 


AACCAGGTGG 


AAGCCGACAA 


TGCGATCTCC 


CCGGCATCAG 


AGCCCCGACG 


1800 


GCGGCCAGAG 


CCATCCTCGT 


CCTCGTCTTC 


GTCCTCGCCG 


GCGGCCCCGG 


CGCGTCCCCG 


1860 


GCCCTGCCCG 


GTGGTCCCGG 


CCCCGGCTCC 


GGGCGACACT 


CACTTCCGCA 


CCTTCCGCTC 


1920 


CCACTCTGAT 


TACCGGCGCA 


TCACGCGGAC 


CAGCGCTCTC 


CTGGACGCCT 


GCGGCTTCTA 


1980 


CTGGGGACCC 


CTGAGCGTGC 


ATGGGGCGCA 


CGAACGGCTG 


CGTTCCGAAC 


CCGTGGGCAC 


2040 


CTTCTTGGTG 


CGCGACAGTC 


GCCAGCGGAA 


CTGCTTCTTC 


GCGCTCAGCG 


TGAAGATGGC 


2100 


TTCGGGCCCC 


ACGAGCATTC 


GTGTGCACTT 


CCAGGCCGGC 


CGCTTCCACC 


TGGACGGCAA 


2160 


CCGCGAGACC 


TTCGACTGCC 


TCTTCGAGCT 


GCTGGAGCAC 


TACGTGGCGG 


CGCCGCGCCG 


2220 


CAJGTTGGGG 


Q.CCCCACTGC 


GCCAGCGCfG, CGTGCGGCCG 


CTGCAGGAGC 


TGTGTCGCCA 


2280 


GCGCATCGTG 


. GCCGCCGTGG 


GTCGCGAGAA 


CCTGGCACGC 


ATCCCTCTTA 


ACCCGGTACT 


2340 
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CCGTGACTAC CTGAGTTCCT TCCCCTTCCA GATCTGACCG GCTGCCGCCG TGCCCGCAGA 24 0 0 

ATTAAGTGGG AGCGCCTTAT TATTTCTTAT TATTAATTAT TATTATTTTT CTGGAACCAC 24 6 0 

GTGGGAGCCC TCCCCGCCTA GGTCGGAGGG AGTGGGTGTG GAGGGTGAGA TCCCTCCCAC 252 0 

TTCTGGCTGG AGACCTTATC CCGCCTCTCG GGGGGCCTCC CCTCCTGGTG CTCCCTCCCG 2 58 0 

GTCCCCCTGG TTGTAGCAGC TTGTGTCTGG GGCCAGGACC TGAACTCCAC GCCTACCTCT 2 64 0 

CCATGTTTAC ATGTTCCCAG TATCTTTGCA CAAACCAGGG GTGGGGGAGG GTCTCTGGCT 27 0 0 

TCATTTTTCT OCTGTGCAGA ATATTCTATT TTATATTTTT ACATCCAGTT TAGATAATAA 276 0 

ACTTTATTAT GAAAGTTTTT TTTTTTAAAG AAACAAAGAT TTCTAGA 2 8 07 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Val Ala Arg Asn Gin Val Glu Ala Asp Asn Ala lie Ser Pro Ala 
15 10 15 

Ser Glu Pro Arg Arg Arg Pro Glu Pro Ser Ser Ser Ser Ser Ser Ser 
20 25 30 

Ser Pro Ala Ala Pro Ala Arg Pro Arg Pro Cys Pro Val Val Pro Ala 
35 40 45 

Pro Ala Pro Gly Asp Thr His Phe Arg Thr Phe Arg Ser His Ser Asp 
.50. ..^^ 5§ _ 60 
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Tyr Arg Arg lie Thr Arg Thr Ser Ala Leu Leu Asp Ala Cys Gly Phe 
65 70 75 80 

Tyr Trp Gly Pro Leu Ser Val His Gly Ala His Glu Arg Leu Arg Ser 
85 90 95 

Glu Pro Val Gly Thr Phe Leu Val Arg Asp Ser Arg Gin Arg Asn Cys 
100 105 110 

Phe Phe Ala Leu Ser Val Lys Met Ala Ser Gly Pro Thr Ser lie Arg 
115 120 125 

Val His Phe Gin Ala Gly Arg Phe His Leu Asp Gly Asn Arg Glu Thr 
130 135 140 

Phe Asp Cys Leu Phe Glu Leu Leu Glu His Tyr Val Ala Ala Pro Arg 
145 150 155 160 

Arg Met Leu Gly Ala Pro Leu Arg Gin Arg Arg Val Arg Pro Leu Gin 
165 170 175 

Glu Leu Cys Arg Gin Arg lie Val Ala Ala Val Gly Arg Glu Asn Leu 
180^ 185 190 

Ala Arg lie Pro Leu Asn Pro Val Leu Arg Asp Tyr Leu Ser Ser Phe 
195 200 205 

Pro Phe Gin lie 
210 



DATED this 14th day of February 1997 

THE WALTER AND ELIZA HALL INSTITUTE 
OF MEDICAL RESEARCH 
By^Its Patent Attorneys 
DAVIES COLLISON CAVE 
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FIGURE 3 
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PRM2 PRM3 



Tnp2 



SOCS-I 



1 kbp 



rCCGCACC 



TATTGGGGACrCCTOAGCGTGCACGGGGCGCACGAGCGGCTGCGTGCCGAGCCCGTGGGC 

ACCTTCrTGGTGCGCGACAGTCGTCAA.CGGA„ACrGCT"CTTCGCGCTCAGCGTG.AAGATG 

A S G ? T 3 l' ?. V H F Q A G R F H L D G 
— - GCTTGGGGGCGCACGAGCATCCGCGTC-CACTTCCAGGCCGGCCGCTTCCACTTGGACGGC 

141 S ?. E T F G- C L F E L L- E H Y V A A .P R 
421 AGCCGCGAGACCTTCGACTGCCTTTTCGAGCTGCTGGAGCAcTACGTGGCGGCGCCGCGC 

151 R M 1 G A ? 1 R Q . R R V R ? L Q E L C R 
4 31 CGCATGTTGGGGGCCCcGCTGCGCCAGCGCCGCGTGCGGCCGCTGCAGGAGCHGTGTCGC 

1 S 1 Q R I •/ A A •/ G R E N L A R I ? L N F V ' 
541 C AGCGCATCGTGGCCGCCGTGGGTCGCGAGAACCTGGCGCGCATCCcTCTTAACCCGGTA 



501 



LRDVL33FFFQI* 
CTCCGTGACTACCTGAGTTCCTTCCCCTTCCAGATCtgaccygctgccgccgtgccgcag 



c c J. cai:*:aag::gcgggcgccz*:act:act tctraccacraa" r.ac cac tat t tttccggaacca 

721 cgcgggagccc tccccgcc tgggccggagggagcggctgtggagggtgagatgcctccca 

7 31 ct tctggctggagacctcaccccacctctcaggggtg'ggggtgccccccccctggtgctc 

841 cctccgggcccccccuggctgcagcagcttgcgtctggggccaggacctgaac tccaccc 

901 ctacc -ccccacgt wtacacac tcccagcatccttgcacaaaccaggggtcggggagggt 

951 ccctggcCLcac - wCtgcngtgcagaataccctacttcatat 1 1: ttacagccagtt ta 

1021 ggcaax.a^ct:t:t:at:t:ai:gaaagt:c!:tt:ct c taaaagaaaaaaaaaaaaaaaaaa 
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